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1. 


INTRODUCTION 


The  Summer  Research  Program  (SRP),  sponsored  by  the  Air  Force  Office  of  Scientific  Research 
(AFOSR),  offers  paid  opportunities  for  university  faculty,  graduate  students,  and  high  school  students 
to  conduct  research  in  U  S.  Air  Force  research  laboratories  nationwide  during  the  summer. 

Introduced  by  AFOSR  in  1978,  this  innovative  program  is  based  on  the  concept  of  teaming  academic 
researchers  with  Air  Force  scientists  in  the  same  disciplines  using  laboratory  facilities  and  equipment 
not  often  available  at  associates'  institutions. 

AFOSR  also  offers  its  research  associates  an  opportunity,  under  the  Summer  Research  Extension 
Program  (SREP),  to  continue  their  AFOSR-sponsored  research  at  their  home  institutions  through  the 
award  of  research  grants.  In  1994  the  maximum  amount  of  each  grant  was  increased  fi'om  $20,000  to 
$25,000,  and  the  number  of  AFOSR-sponsored  grants  decreased  fi'om  75  to  60.  A  separate  annual 
report  is  compiled  on  the  SREP. 

The  Summer  Faculty  Research  Program  (SFRP)  is  open  annually  to  approximately  150  faculty 
members  with  at  least  two  years  of  teaching  and/or  research  experience  in  accredited  U.S.  colleges, 
universities,  or  technical  institutions.  SFRP  associates  must  be  either  U.S.  citizens  or  permanent 
residents. 

The  Graduate  Student  Research  Program  (GSRP)  is  open  annually  to  approximately  100  graduate 
students  holding  a  bachelor's  or  a  master's  degree;  GSRP  associates  must  be  U.S.  citizens  enrolled  full 
time  at  an  accredited  institution. 

The  High  School  Apprentice  Program  (HSAP)  annually  selects  about  125  high  school  students  located 
within  a  twenty  mile  commuting  distance  of  participating  Air  Force  laboratories. 

The  numbers  of  projected  summer  research  participants  in  each  of  the  three  categories  are  usually 
increased  through  direct  sponsorship  by  participating  laboratories. 

AFOSR's  SRP  has  well  served  its  objectives  of  building  critical  links  between  Air  Force  research 
laboratories  and  the  academic  community,  opening  avenues  of  communications  and  forging  new 
research  relationships  between  Air  Force  and  academic  technical  experts  in  areas  of  national  interest; 
and  strengthening  the  nation's  efforts  to  sustain  careers  in  science  and  engineering.  The  success  of  the 
SRP  can  be  gauged  fi'om  its  growth  from  inception  (see  Table  1)  and  from  the  favorable  responses  the 
1994  participants  expressed  in  end-of-tour  SRP  evaluations  (Appendbc  B). 

AFOSR  contracts  for  administration  of  the  SRP  by  civilian  contractors.  The  contract  was  first 
awarded  to  Research  &  Development  Laboratories  (RDL)  in  September  1990.  After  completion  of 
the  1990  contract,  RDL  won  the  recompetition  for  the  basic  year  and  four  1-year  options. 
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2.  PARTICIPATION  IN  THE  SUMMER  RESEARCH  PROGRAM 

The  SRP  began  with  faculty  associates  in  1979;  graduate  students  were  added  in  1982  and  high  school 
students  in  1986.  The  following  table  shows  the  number  of  associates  in  the  program  each  year. 


Table  1;  SRP  Participation,  by  Year 


YEAR 

Number  of  Participants 

TOTAL 

SFRP 

GSRP 

HSAP 

1979 

70 

70 

1980 

87 

. 

87 

1981 

87 

87 

1982 

91 

17 

. .  ..... 

108 

1983 

101 

53 

154 

1984 

152 

84 

236 

1985 

154 

92 

246 

1986 

158 

100 

42 

300 

1987 

159 

101 

73 

333 

1988 

153 

107 

101 

361 

1989 

168 

102 

103 

373 

1990 

165 

121 

132 

418 

1991 

170 

142 

132 

444 

1992 

185 

121 

159 

464 

1993 

187 

117 

136 

440 

1994 

192 

117 

133 

442 

Beginning  in  1993,  due  to  budget  cuts,  some  of  the  laboratories  weren’t  able  to  afford  to  fiind  as  many 
associates  as  in  previous  years;  in  one  case  a  laboratory  did  not  fiind  ^  additional  associates. 
How'ever,  the  table  shows  that,  overall,  the  number  of  participating  associates  increased  this  year 
because  two  laboratories  funded  more  associates  than  they  had  in  previous  years. 
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3. 


RECRUmNG  AND  SELECTION 


The  SRP  is  conducted  on  a  nationally  advertised  and  competitive-selection  basis.  The  advertising  for 
faculty  and  graduate  students  consisted  primarily  of  the  mailing  of  8,000  44-page  SRP  brochures  to 
chairpersons  of  departments  relevant  to  AFOSR  research  and  to  administrators  of  grants  in  accredited 
universities,  colleges,  and  technical  institutions.  Historically  Black  Colleges  and  Universities  (HBCUs) 
and  Minority  Institutions  (Mis)  were  included.  Brochures  also  went  to  all  participating  USAF 
laboratories,  the  previous  year's  participants,  and  numerous  (over  600  annually)  individual  requesters. 

Due  to  a  delay  in  awarding  the  new  contract,  RDL  was  not  able  to  place  advertisements  in  any  of  the 
following  publications  in  which  the  SRP  is  normally  advertised:  Black  Issues  in  Higher  Education, 
Chemical  &  Engineering  News.  lElEE  Spectrum  and  Physics  Today. 

High  school  applicants  can  participate  only  in  laboratories  located  no  more  than  20  miles  from  their 
residence.  Tailored  brochures  on  the  HSAP  were  sent  to  the  head  counselors  of  180  high  schools  in 
the  vicinity  of  participating  laboratories,  with  instructions  for  publicizing  the  program  in  their  schools. 
High  school  students  selected  to  serve  at  Wright  Laboratory's  Armament  Directorate  (Eglin  Air  Force 
Base,  Florida)  serve  eleven  weeks  as  opposed  to  the  eight  weeks  normally  worked  by  high  school 
students  at  all  other  participating  laboratories. 

Each  SFRP  or  GSRP  applicant  is  given  a  first,  second,  and  third  choice  of  laboratory.  High  school 
students  who  have  more  than  one  laboratory  or  directorate  near  their  homes  are  also  given  first, 
second,  and  third  choices. 

Laboratories  make  their  selections  and  prioritize  their  nominees.  AFOSR  then  determines  the  number 
to  be  funded  at  each  laboratory  and  approves  laboratories'  selections. 

Subsequently,  laboratories  use  their  own  funds  to  sponsor  additional  candidates.  Some  selectees  do 
not  accept  the  appointment,  so  alternate  candidates  are  chosen.  This  multi-step  selection  procedure 
results  in  some  candidates  being  notified  of  their  acceptance  after  scheduled  deadlines.  The  total 
applicants  and  participants  for  1994  are  shown  in  this  table. 


Table  2:  1994  Applicants  and  Participants 


PARTICIPANT 

TOTAL 

SELECTEES 

DECLINING 

CATEGORY 

APPLICANTS 

SELECTEES 

SFRP 

600 

192 

30 

(HBCU/MI) 

(90) 

(16) 

(7) 

GSRP 

322 

117 

11 

(HBCU/MI) 

(11) 

(6) 

(0) 

HSAP 

562 

133 

14 

TOTAL 

1484 

442 

55 
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SITE  VISITS 


During  June  and  July  of  1994,  representatives  of  both  AFOSR/NI  and  RDL  visited  each  participating 
laboratory  to  provide  briefings,  answer  questions,  and  resolve  problems  for  both  laboratory  personnel 
and  participants.  The  objective  was  to  ensure  that  the  SRP  w'ould  be  as  constructive  as  possible  for  all 
participants.  Both  SRP  participants  and  RDL  representatives  found  these  visits  beneficial.  At  many  of 
the  laboratories,  tWs  was  the  only  opportunity  for  all  participants  to  meet  at  one  time  to  share  their 
experiences  and  exchange  ideas. 


5.  fflSTORICALLY  BLACK  COLLEGES  AND  UNIVERSITIES  AND  MINORTn' 
INSTITUTIONS  (HBCU/Mb) 

In  previous  years,  an  RDL  program  representative  visited  from  seven  to  ten  different  HBCU/MIs  to 
promote  interest  in  the  SRP  among  the  faculty  and  graduate  students.  Due  to  the  late  contract  award 
date  (January  1994)  no  time  was  available  to  visit  HBCU/MIs  this  past  year. 

In  addition  to  RDL's  special  recruiting  efforts,  AFOSR  attempts  each  year  to  obtain  additional  funding 
or  use  leftover  funding  from  cancellations  the  past  year  to  fund  HBCU/MI  associates.  This  year,  seven 
HBCU/MI  SFRPs  declined  after  they  were  selected.  The  following  table  records  HBCU/MI 
participation  in  this  program. 


T^le_T  SRP  HBCU/MI  Participation,  by  Year 


YEAR 

SFRP 

GSRP 

Applicants 

Participants 

Applicants 

Participants 

1985 

76 

23 

15 

11 

70 

18 

20 

10 

1987 

82 

32 

32 

10 

1988 

53 

17 

23 

14 

1989 

39 

15 

13 

4 

1990 

43 

14 

17 

3 

1991 

42 

13 

8 

5 

1992 

70 

13 

9 

5 

1993 

60 

13 

6 

2 

1994 

90 

16 

11 

6 
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SRP  FUNDING  SOURCES 


Funding  sources  for  the  1994  SRP  were  the  AFOSR-provided  slots  for  the  basic  contract  and 
laboratory  funds.  Funding  sources  by  category  for  the  1994  SRP  selected  participants  are  shown  here. 

_ Table  4:  1994  SRP  Associate  Funding _ 


FUNDING  CATEGORY 

SFRP 

GSRP 

HSAP 

AFOSR  Basic  Allocation  Funds 

150 

98*' 

121*^ 

USAF  Laboratory  Funds 

37 

19 

12 

IffiCU/MI  By  AFOSR 
(Using  Procured  Addn’l  Funds) 

5 

0 

0 

TOTAL 

192 

117 

133 

*  1  -  100  were  selected,  but  two  canceled  too  late  to  be  replaced. 
*2  -  125  were  selected,  but  four  canceled  too  late  to  be  replaced. 


7.  COMPENSATION  FOR  PARTICIPANTS 


Compensation  for  SRP  participants,  per  five-day  work  week,  is  shown  in  this  table. 


Table  5:  1994  SRP  Associate  Compensation 


PARTICIPANT  CATEGORY 

1991 

1992 

1993 

1994 

Faculty  Members 

$690 

$718 

$740 

$740 

Graduate  Student 
(Master's  Degree) 

$425 

$442 

$455 

$455 

Graduate  Student 
(Bachelor's  Degree) 

$365 

$380 

$391 

$391 

High  School  Student 
(First  Year) 

$200 

$200 

$200 

$200 

High  School  Student 

1  (Subsequent  Years) 

$240 

$240 

$240 

$240 
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The  program  also  offered  associates  whose  homes  were  more  than  50  miles  from  the  laboratory  an 
expense  allowance  (seven  days  per  week)  of  $50/day  for  faculty  and  $37/day  for  graduate  students. 
Transportation  to  the  laboratory  at  the  beginning  of  their  tour  and  back  to  their  home  destinations  at 
the  end  was  also  reimbursed  for  these  participants.  Of  the  combined  SFRP  and  GSRP  associates,  58% 
(178  out  of 309)  claimed  travel  reimbursements  at  an  average  round-trip  cost  of  $860. 

Faculty  members  were  encouraged  to  visit  their  laboratories  before  their  summer  tour  began.  All  costs 
of  these  orientation  visits  were  reimbursed.  Forty-one  percent  (78  out  of  192)  of  faculty  associates 
took  orientation  trips  at  an  average  cost  of  $498.  Many  faculty  associates  noted  on  their  evaluation 
forms  that  due  to  the  late  notice  of  acceptance  into  the  1994  SRP  (caused  by  the  late  award  in  January 
1994  of  the  contract)  there  wasn’t  enough  time  to  attend  an  orientation  visit  prior  to  their  tour  start 
date.  In  1993,  58  %  of  SFRP  associates  took  orientation  visits  at  an  average  cost  of  $685. 

Program  participants  submitted  biweekly  vouchers  countersigned  by  their  laboratory  research  focal 
point,  and  RDL  issued  paychecks  so  as  to  arrive  in  associates'  hands  two  weeks  later. 

HSAP  program  participants  were  considered  actual  RDL  employees,  and  their  respective  state  and 
federal  income  tax  and  Social  Security  were  withheld  from  their  paychecks.  By  the  nature  of  their 
independent  research,  SFRP  and  GSRP  program  participants  were  considered  to  be  consultants  or 
independent  contractors.  As  such,  SFRP  and  GSRP  associates  were  responsible  for  their  own  income 
taxes.  Social  Security,  and  insurance. 


8.  CONTENTS  OF  THE  1994  REPORT 


The  complete  set  of  reports  for  the  1994  SRP  includes  this  program  management  report  augmented  by 
fifteen  volumes  of  final  research  reports  by  the  1994  associates  as  indicated  below: 


Table  6:  1994  SRP  Final  Report  Volume  Assignments 


LABORATORY 

SFRP 

VOLUME 

GSRP 

HSAP 

Armstrong 

2 

7 

12 

Phillips 

3 

8 

13 

Rome 

4 

9 

14 

Wright 

5A,  5B 

10 

15 

AEDC,  FJSRL,  WHMC 

6 

11 

16 

AEDC 

FJSRL 

WHMC 


Arnold  Engineering  Development  Center 
Frank  J.  Seiler  Research  Laboratory 
Wilford  Hall  Medical  Center 
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APPENDIX  A  -  PROGRAM  STATISTICAL  SUMMARY 


A.  CollegesTJniversities  Represented 

Selected  SFRP  and  GSRP  associates  represent  158  diflFerent  colleges,  universities,  and 
institutions. 


B.  States  Represented 

SFRP  -Applicants  came  from  46  states  plus  Washington  D  C.  and  Puerto  Rico.  Selectees 
represent  40  states. 

GSRP  -  Applicants  came  from  46  states  and  Puerto  Rico.  Selectees  represent  34  states. 

HSAP  -  Applicants  came  from  fifteen  states.  Selectees  represent  ten  states. 


C.  Academic  Disciplines  Represented 

The  academic  disciplines  of  the  combined  192  SFRP  associates  are  as  follows: 


Electrical  Engineering 

22.4% 

Mechanical  Engineering 

14.0% 

Physics:  General,  Nuclear  &  Plasma 

12.2% 

Chemistry  &  Chemical  Engineering 

11.2% 

Mathematics  &  Statistics 

8.1% 

Psychology 

7.0% 

Computer  Science 

6.4% 

Aerospace  &  Aeronautical  Engineering 

4.8% 

Engineering  Science 

2.7% 

Biology  &  Inorganic  Chemistry 

2.2% 

Physics:  Electro-Optics  &  Photonics 

2.2% 

Communication 

1.6% 

Industrial  &  Civil  Engineering 

1.6% 

Physiology 

1.1% 

Polymer  Science 

1.1% 

Education 

0.5% 

Pharmaceutics 

0.5% 

Veterinarv  Medicine 

0.5% 

TOTAL  100% 
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Table  A-1.  Total  Participants 


Number  of  Participants 

SFRP 

192 

GSRP 

117 

HSAP 

133 

TOTAL 

442 

Table  A-2.  Degrees  Represented 


Degrees  Represented 

SFRP 

GSRP 

TOTAL 

Doctoral 

189 

0 

189 

Master's 

J 

47 

50 

Bachelor's 

0 

70 

70 

TOTAL 

192 

117 

309 

Table  A-3.  SFRP  Academic  Titles 


Academic  Titles 

Assistant  Professor 

74 

Associate  Professor 

63 

Professor 

44 

Instructor 

5 

Chairman 

1 

Visiting  Professor 

1 

Visiting  Assoc,  Prof 

1 

Research  Associate 

TOTAL 

192 
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Table  A-4,  Source  of  Learning  About  SRP 


SOURCE 

SFRP 

GSRP 

Applicants 

Selectees 

Applicants 

Selectees 

Applied/participated  in  prior  years 

26% 

37% 

10% 

13% 

Colleague  familiar  with  SRP 

19% 

17% 

12% 

12% 

Brochure  mailed  to  institution 

32% 

18% 

19% 

12% 

Contact  with  Air  Force  laboratory 

15% 

24% 

9% 

12% 

Faculn  Advisor  (GSRPs  Only) 

- 

1 

1 

39% 

43% 

Other  source 

8% 

4% 

11% 

8% 

TOTAL 

100% 

100% 

100% 

100% 

Table  A-5.  Ethnic  Background  of  Applicants  and  Selectees 


SFRP 

GSRP 

HSAP 

Applicants 

Selectees 

Applicants 

Selectees 

Applicants 

Selectees 

American  Indian  or 

0.2% 

0% 

1% 

0% 

0.4% 

0% 

Native  Alaskan 

Asian/Pacific  Islander 

30% 

20% 

6% 

8% 

7% 

10% 

Black 

4% 

1.5% 

3% 

3% 

7% 

2% 

Hispanic 

3% 

1.9% 

4% 

4.5% 

11% 

8% 

Caucasian 

51% 

63% 

77% 

77% 

70% 

75% 

Preferred  not  to  answer 

12% 

14% 

9% 

7% 

4% 

5% 

TOTAL 

100% 

100% 

100% 

100% 

99% 

100% 

Table  A-6.  Percentages  of  Selectees  receiving  their  1st,  2nd,  or  3rd  Choices  of  Directorate 


1st 

Choice 

2nd 

Choice 

3rd 

Choice 

Other  Than 

Their  Choice 

SFRP 

70% 

7% 

•  3% 

20% 

GSRP 

76% 

2% 

2% 

20% 
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APPENDIX  B  -  SRP  EVALUATION  RESPONSES 


1.  OVTRVTEW 

Evaluations  were  completed  and  returned  to  RDL  by  four  groups  at  the  completion  of  the  SRP.  The 
number  of  respondents  in  each  group  is  shown  below. 


Table  B-1 .  Total  SRP  Evaluations  Received 


Evaluation  Group 

Responses 

SFRP  &  GSRPs 

275 

HSAPs 

116 

USAF  Laboratory  Focal  Points 

109 

US^AE  Laboratory  HSAP  Mentors 

54 

All  groups  indicate  near-unanimous  enthusiasm  for  the  SRP  experience. 


Typical  comments  from  1994  SRP  associates  are: 

"[The  SRP  w'as  an]  excellent  opportunity  to  work  in  state-of-the-art  facility  with  top-notch 
people." 

"[The  SRP  experience]  enabled  exposure  to  interesting  scientific  application  problems; 
enhancement  of  knowledge  and  insight  into  'real-world'  problems." 

"[The  SRP]  was  a  great  opportunity  for  resourceful  and  independent  faculty  [members]  from 
small  colleges  to  obtain  research  credentials." 

"The  laboratory  personnel  I  worked  with  are  tremendous,  both  personally  and  scientifically.  I 
cannot  emphasize  how  wonderful  they  are." 

"The  one-on-one  relationship  with  my  mentor  and  the  hands  on  research  experience  improved 
[my]  understanding  of  physics  in  addition  to  improving  my  library  research  skills.  Very 
valuable  for  [both]  college  and  career!" 
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Typical  comments  from  laboratory  focal  points  and  mentors  are: 

"This  program  [AFOSR  -  SFRP]  has  been  a  ‘God  Send'  for  us.  Ties  established  with  summer 
facultv-  have  proven  invaluable." 

"Program  was  excellent  from  our  perspective.  So  much  was  accomplished  that  new  options 
became  viable  " 

"This  program  managed  to  get  around  most  of  the  red  tape  and  ‘BS’  associated  with  most  Air 
Force  programs.  Good  Job!" 

'Great  program  for  high  school  students  to  be  introduced  to  the  research  environment.  Highly 
educational  for  others  [at  laboratory]." 

"This  is  an  excellent  program  to  introduce  students  to  technology  and  give  them  a  feel  for 
[science/engineering]  career  fields.  I  view  any  return  benefit  to  the  government  to  be  "icing  on 
the  cake’  and  have  usually  benefitted.” 

The  summarized  recommendations  for  program  improvement  from  both  associates  and  laboratory 
personnel  are  listed  below  (Note:  basically  the  same  as  in  previous  years.) 

A.  Better  preparation  on  the  labs’  part  prior  to  associates'  arrival  (i.e.,  office  space, 
computer  assets,  clearly  defined  scope  of  work). 

B.  Laboratory  sponsor  seminar  presentations  of  work  conducted  by  associates,  and/or 
organized  social  functions  for  associates  to  collectively  meet  and  share  SRP 
experiences. 

C.  Laboratory  focal  points  collectively  suggest  more  AFOSR  allocated  associate 
positions,  so  that  more  people  may  share  in  the  experience. 

D.  Associates  collectively  suggest  higher  stipends  for  SRP  associates. 

E.  Both  HSAP  Air  Force  laboratory  mentors  and  associates  would  like  the  summer  tour 
extended  from  the  current  8  weeks  to  either  1 0  or  11  weeks;  the  groups  state  it  takes 
4-6  weeks  just  to  get  high  school  students  up-to-speed  on  what’s  going  on  at 
laboratory.  (Note:  this  same  arguement  was  used  to  raise  the  faculty  and  graduate 
student  participation  time  a  few  years  ago.) 


B-2 


2.  1994  USAF  LABORATORY  FOCAL  POINT  (LFP)  EVALUATION  RESPONSES 


The  summarized  results  listed  below  are  from  the  109  LFP  evaluations  received. 

1 .  LFP  evaluations  received  and  associate  preferences: 

Table  B-2.  Air  Force  LFP  Evaluation  Responses  (By  Type) 


How  Many  Associates  Would  You  Prefer  To  Get 

(%  Response) 

SFRP 

GSRP  (w/Univ  Professor) 

GSRP  (w/o  Univ  Professor) 

Lab 

Evals 

Reev’d 

0 

1 

2 

3+ 

0 

1 

2 

3+ 

0 

1 

2 

3+ 

AEDC 

10 

30 

50 

0 

20 

50 

40 

0 

10 

40 

60 

0 

0 

AL 

44 

34 

50 

6 

9 

54 

34 

12 

0 

56 

31 

12 

0 

FJSRL 

3 

33 

33 

33 

0 

67 

33 

0 

0 

33 

67 

0 

0 

PL 

14 

28 

43 

28 

0 

57 

21 

21 

0 

71 

28 

0 

0 

RL 

3 

33 

67 

0 

0 

67 

0 

33 

0 

100 

0 

0 

0 

WHMC 

1 

0 

0 

100 

0 

0 

100 

0 

0 

0 

100 

0 

0 

WL 

46 

15 

61 

24 

0 

56 

30 

13 

0 

76 

17 

6 

0 

Total 

121 

25% 

43% 

27% 

4% 

50% 

37% 

11% 

1% 

54% 

43% 

3% 

0% 

LFP  Evaluation  Summary.  The  summarized  repsonses,  by  laboratorv',  are  listed  on  the  following 
page.  LFPs  were  asked  to  rate  the  following  questions  on  a  scale  from  1  (below  average)  to  5  (above 
average). 

2.  LFPs  involved  in  SRP  associate  application  evaluation  process: 

a.  Time  available  for  evaluation  of  applications: 

b.  Adequacy  of  applications  for  selection  process: 

3.  Value  of  orientation  trips: 

4.  Length  of  research  tour: 

5  a.  Benefits  of  associate's  work  to  laboratory; 
b.  Benefits  of  associate's  work  to  Air  Force: 

6.  a.  Enhancement  of  research  qualifications  for  LFP  and  staff; 

b.  Enhancement  of  research  qualifications  for  SFRP  associate: 

c.  Enhancement  of  research  qualifications  for  GSRP  associate; 

7.  a.  Enhancement  of  knowledge  for  LFP  and  staff: 

b.  Enhancement  of  knowledge  for  SFRP  associate: 

c.  Enhancement  of  knowledge  for  GSRP  associate; 

8.  Value  of  Air  Force  and  university  links: 

9.  Potential  for  future  collaboration: 

10.  a.  Your  working  relationship  with  SFRP: 
b.  Your  working  relationship  with  GSRP: 

11.  Expenditure  of  your  time  worthwhile: 

(Continued  on  next  page) 
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12.  Quality  of  program  literature  for  associate: 

13.  a.  Quality  of  RDL's  communications  with  you: 

b.  Quality  of  RDL's  communications  with  associates: 

14.  Overall  assessment  of  SRP: 


Laboratory  Focal  Point  Reponses  to  above  questions 


AEDC 

AL 

FJSRL 

PL 

RL 

wmic 

WL 

=  Evals  Rec\’  'd 

10 

32 

■> 

3 

1 

46 

Question  = 

2 

90% 

62% 

100% 

64% 

100  % 

100% 

83% 

2a 

3.5 

3.5 

4.7 

4.4 

4.0 

4.0 

3.7 

2b 

4.0 

3.8 

4.0 

4.3 

4.3 

4.0 

3.9 

3 

4.2 

3.6 

4.3 

3.8 

4.7 

4.0 

4.0 

4 

3.8 

3.9 

4.0 

4.2 

4.3 

NO  ENTRY 

4.0 

5a 

4.1 

4.4 

4.7 

4.9 

4.3 

3.0 

4,6 

5b 

4.0 

4.2 

4.7 

4.7 

4.3 

3.0 

4.5 

6a 

3.6 

4.1 

3.7 

4.5 

4.3 

3.0 

4.1 

6b 

3.6 

4.0 

4.0 

4.4 

4.7 

3.0 

4.2 

6c 

3.3 

4.2 

4.0 

4.5 

4,5 

3  0 

4.2 

7a 

3.9 

4.3 

4.0 

4.6 

4.0 

3.0 

4.2 

7b 

4.1 

4.3 

4.3 

4.6 

4.7 

3.0 

4.3 

7c 

3.3 

4.1 

4,5 

4.5 

4.5 

5.0 

4.3 

8 

4.2 

4.3 

5.0 

4.9 

4,3 

5.0 

4.7 

9 

3.8 

4.1 

4.7 

5.0 

4.7 

5.0 

4.6 

10a 

4.6 

4.5 

5.0 

4.9 

4.7 

5.0 

4.7 

10b 

4.3 

4.2 

5.0 

4,3 

5.0 

5.0 

4.5 

11 

4.1 

4.5 

4.3 

4.9 

4.7 

4.0 

4.4 

12 

4.1 

3.9 

4.0 

4,4 

4.7 

3.0 

4.1 

13a 

3.8 

2.9 

4.0 

4.0 

4.7 

3.0 

3.6 

13b 

3.8 

2.9 

4.0 

4.3 

4.7 

3.0 

3.8 

14 

4.5 

4.4 

5.0 

4.9 

4.7 

4.0 

4.5 
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3.  1994  SFRP  &  GSRP  EVALUATION  RESPONSES 


The  summarized  results  listed  below  are  from  the  275  SFRP/GSRP  evaluations  received. 


Associates  were  asked  to  rate  the  following  questions  on  a  scale  from 
I  (below  average)  to  5  (above  average) 


1 .  The  match  betu'een  the  laboratories  research  and  your  field: 

4.6 

2.  Your  working  relationship  with  your  LFP: 

4.8 

3.  Enhancement  of  your  academic  qualifications: 

4.4 

4  Enhancement  of  your  research  qualifications. 

4.5 

5  Lab  readiness  for  you:  LFP,  task,  plan: 

4.3 

6  Lab  readiness  for  you:  equipment,  supplies,  facilities: 

4.1 

7  Lab  resources: 

4.3 

8.  Lab  research  and  administrative  support: 

4.5 

9.  Adequacy  of  brochure  and  associate  handbook: 

4.3 

10.  RDL  communications  with  you: 

4.3 

1 1 .  Overall  payment  procedures. 

3.8 

12.  Overall  assessment  of  the  SRP: 

4.7 

13.  a.  Would  you  apply  again? 

Yes: 

85% 

b.  Will  you  continue  this  or  related  research? 

Yes: 

95% 

14.  Was  length  of  your  tour  satisfactory"’ 

Yes: 

86% 

15.  Percentage  of  associates  who  engaged  in: 

a.  Seminar  presentation: 

52% 

b.  Technical  meetings: 

32% 

c.  Social  Sanctions: 

03% 

d.  Other 

01% 
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16.  Percentage  of  associates  who  experienced  difficulties  in: 


a.  Finding  housing:  12% 

b.  Check  Cashing:  03% 

1 7  Where  did  you  stay  during  your  SRP  tour? 

a.  At  Home:  20% 

b.  With  Friend:  06% 

c.  On  Local  Economy:  47% 

d.  Base  Quarters:  10% 

THIS  SECTION  FACULTY'  ONLY; 

18.  Were  graduate  students  working  with  you'^  Yes:  23% 

19.  Would  you  bring  graduate  students  next  year'’  Yes:  56% 

20.  Value  of  orientation  visit: 

Essential:  29% 

Convenient:  20% 

Not  Worth  Cost:  01% 

Not  Used:  34% 

Tins  SECTION  GRADUATE  STUDENTS  ONLY: 

2 1 .  Who  did  you  work  with: 

University  Professor:  18% 

Laboratory  Scientist:  54% 
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4.  1994  USAF  LABORATORY  HSAP  MENTOR  EVALUATION  RESPONSES 


The  summarized  results  listed  below  are  from  the  54  mentor  evaluations  received. 


1 .  Mentor  apprentice  preferences: 


Table  B-3 .  Air  Force  Mentor  Responses 


How  Many  Apprentices  Would 
You  Prefer  To  Get  ? 

HSAP  Apprentices  Preferred 

Laboratory 

#  Evals 
Reev’d 

1 

2 

3+ 

AEDC 

6 

0 

100 

0 

0 

AL 

17 

29 

47 

6 

18 

PL 

9 

22 

78 

0 

0 

RL 

4 

25 

75 

0 

0 

WL 

18 

22 

55 

17 

6 

Total 

54 

20% 

71% 

5% 

5% 

Mentors  were  asked  to  rate  the  following  questions  on  a  scale  from 
1  (below  average)  to  5  (above  average) 

2.  Mentors  involved  in  SRP  apprentice  application  evaluation  process: 

a.  Time  available  for  evaluation  of  applications: 

b.  Adequacy  of  applications  for  selection  process: 

3.  Laboratory's  preparation  for  apprentice: 

4.  Mentor's  preparation  for  apprentice: 

5.  Length  of  research  tour: 

6.  Benefits  of  apprentice's  work  to  U.S,  Air  force: 

7.  Enhancement  of  academic  qualifications  for  apprentice: 

8.  Enhancement  of  research  skills  for  apprentice: 

9.  Value  of  U.S.  Air  Force/high  school  links: 

10.  Mentor's  working  relationship  with  apprentice: 

1 1 .  Expenditure  of  mentor's  time  worthwhile: 

12.  Quality  of  program  literature  for  apprentice: 

13.  a.  Quality  ofRDL's  communications  with  mentors: 
b.  Quality  ofRDL's  communication  with  apprentices: 

14.  Overall  assessment  of  SRP: 
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AEDC 

AL 

PL 

RL 

WL 

#  Evals  Reev’d 

6 

9 

4 

18 

Question  # 

2 

100% 

76% 

56  % 

15% 

61  % 

2a 

4.2 

4.0 

3.1 

3.7 

3.5 

2b 

4.0 

4.5 

4.0 

4.0 

3.8 

3 

4.3 

3.8 

3.9 

3.8 

3.8 

4 

4.5 

3.7 

3.4 

4.2 

3.9 

5 

3.5 

4.1 

3.1 

3.7 

3.6 

6 

4.3 

3.9 

4.0 

4.0 

4.2 

7 

4.0 

4.4 

4.3 

4.2 

3.9 

8 

4.7 

4.4 

4.4 

4.2 

4.0 

9 

4.7 

4.2 

3.7 

4.5 

4.0 

10 

4.7 

4.5 

4.4 

4.5 

4.2 

11 

4.8 

4.3 

4.0 

4.5 

4.1 

12 

4.2 

4.1 

4.1 

4.8 

3.4 

13a 

3.5 

3.9 

3.7 

4.0 

3.1 

13b 

4.0 

4.1 

3.4 

4.0 

3.5 

14 

4.3 

4.5 

3.8 

4.5 

4.1 
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5.  1994  HSAP  EVALUATION  RESPONSES 


The  summarized  results  listed  below  are  from  the  116  HSAP  evaluations  received. 

HSAP  apprentices  were  asked  to  rate  the  following  questions  on  a  scale  from 
1  (below  average)  to  5  (above  average) 

I .  Match  of  lab  research  to  you  interest:  3.9 

2  Apprentices  working  relationship  with  their  mentor  and  other  lab  scientists:  4  6 

3.  Enhancement  of  your  academic  qualifications:  4.4 

4.  Enhancement  of  your  research  qualifications:  4  1 

5.  Lab  readiness  for  you:  mentor,  task,  work  plan  3.7 

6.  Lab  readiness  for  you:  equipment  supplies  facilities  4.3 

7.  Lab  resources:  availability  4.3 

8.  Lab  research  and  administrative  support:  4.4 

9.  Adequacy  of  RDL’s  apprentice  handbook  and  administrative  materials:  4.0 

10.  Responsiveness  of  RDL’s  communications:  3.5 

I I .  Overall  payment  procedures:  3.3 

12.  Overall  assessment  of  SRP  value  to  you:  4  5 

1 3 .  Would  you  apply  again  next  year?  Yes:  88% 

14.  Was  length  of  SRP  tour  satisfactory?  Yes:  78% 

15.  Percentages  of  apprentices  who  engaged  in: 

a.  Seminar  presentation:  48% 

b.  Technical  meetings:  23% 

c.  Social  functions:  18% 
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A  PROGRAM  PLAN  FOR  TRANSMITTING 
HIGH-DATA-RATE  ATM/SONET  SIGNALS  OVER  THE  ACTS 


Valentine  Aalo 
Assistant  Professor 
Electricai  Engineering  Department 
Florida  Atlantic  University 


Okechukwu  Ugweje 
Electrical  Engineering  Department 
Florida  Atlantic  University 


Abstract 


The  feasibility,  desirability  and  usefulness  of  Asynchronous  Transfer  Mode  and  Synchronous  Optical 
Network  transmission  protocols  over  the  Advanced  Communications  Technology  Satellite  (ACTS)  was 
studied.  A  program  plan  for  the  transmission  of  Asynchronous  Transmission  Mode  and  Synchronous 
Optical  Network  signals  at  high  data  rates  via  the  ACTS  satellite  was  deveioped  for  the  U.  S.  Air  Force 
Rome  Laboratory.  The  high  data  rate  terminals  will  transmit  and  receive  signals  at  DS-3  (45  Mbps)  and 
OC-3  (155  Mbps)  over  the  NASA's  ACTS. 
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A  PROGRAM  PLAN  FOR  TRANSMITTING 
HIGH-DATA-RATE  ATM/SONET  SIGNALS  OVER  THE  ACTS 

Valentine  Aalo 
Okechukwu  Ugweje 


1  INTRODUCTION 

Communication  systems  have  developed  at  a  very  rapid  pace  over  the  iast  decade.  One  of  the  out¬ 
comes  of  this  development  is  in  the  area  of  broadband  packet  switching.  In  the  last  few  years,  research 
activities  in  this  fieid  have  been  very  active.  Hardly  any  publication  presentiy  on  switching  networks  can 
be  opened  without  the  mention  of  ATM.  ATM  is  now  regarded  as  the  communication  protocoi  of  the  fu¬ 
ture  and  aii  efforts  are  being  made  by  researchers  to  find  more  effective  use  of  this  new  technology. 

The  objective  of  this  program  plan  is  to  study  the  feasibiiity,  desirabiiity  and  usefuiness  of  using 
ATM/SONET  transmission  protocols  over  the  NASA's  Advanced  Communication  Transmission  Satellite 
(ACTS)  for  multimedia  appiication  at  HDR.  This  objective  is  a  subset  of  a  proposed  high  speed  network 
under  development  which  will  link  the  assets  of  the  Air  Force  at  Rome  Laboratory  (RL),  the  Army  at 
Communications  Eiectronic  Command  (CECOM),  and  the  Navy  at  Naval  Research  and  Development 
(NRaD).  This  network  has  the  potential  of  becoming  a  world  wide  broadband  network  connecting  a  large 
number  of  remote  tactical  sites  (i.e.  U.S.  Navy  ships)  with  voice,  video  and  data.  This  objective  will 
then  be  integrated  into  the  Technical  Technology  Cooperation  Program  (TTCP)  network  in  which  RL  will 
communicate  via  the  satellite  with  the  Canadians  at  the  Communication  Research  Centre  (CRC),  NRaD 
will  communicate  with  the  Australians  and  CECOM  with  Great  Britain  [1]. 

The  proposed  experiment  wiii  consist  of  two  phases.  Under  phase  I  experiment,  the  transmission  of 
ATM  ceiis  wiil  be  performed  at  45  Mbps  between  RL  and  CRC.  Phase  II  experiment  wiii  be  performed 
between  RL  and  the  Joint  interoperability  Test  Center  (JITC),  Ft.  Huachuca,  Arizona  at  155  Mbps. 

This  report  describes  the  program  plan  necessary  to  achieve  these  objectives,  it  documents  the  equip¬ 
ment  and  satellite  requirements,  data  requirements,  test  procedures  and  ail  relevant  mathematical  cal¬ 
culations. 

2  PRELIMINARIES 

2.1  Asynchronous  Transfer  Mode  (ATM) 

ATM  is  a  broad-bandwidth,  low  delay,  packet-like  switching  and  muitiplexing  protocoi.  It  is  a  technique 

that  uses  constant-length  packets  as  the  basic  units  whereby  the  user  information  is  carried  in  cells  and 

is  transferred  asynchronously  when  it  appears  at  the  input  of  a  communication  system.  It  is  essentially 

connection  oriented  whereby  information  is  buffered  as  it  arrives  and  inserted  into  an  ATM  cell  when 

there  is  enough  to  fill  the  cell,  and  is  then  transported  asynchronously  across  the  network. 
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2.1 .1  Basic  Principles  of  ATM 
Cell  Structure: 


Header  (5  Octets) 


1  octet  =  1  byte;  1  cell  =  53  octets  =  424  bits;  Header  =  5  octets  =  40  bits. 

Two  header  cell  formats  have  been  specified. 

1.  User-Network  Interface  (UNI) 

This  header  format  is  used  for  user-to-ATM  connections. 

bit  8  1 


4  8  16  3  1  8  bits 


2.  Network-Node  Interface  (NNI) 

NNI  is  used  for  ATM-to-ATM  connections. 


where  GFC: 

Generic  Flow  Control 

VPI: 

Virtual  Path  Identifier 

VCI: 

Virtual  Channel  Identifier 

PT; 

Payload  Type 

CLP: 

Cell  Loss  Priority 

HEC: 

Header  Error  Control 

In  ATM,  cells  from  more  than  one  connection  are  multiplexed  together,  and  at  the  receiving  end  the  cell 
sequence  is  preserved.  The  information  field  (payload)  is  transported  transparently  (i.e.  no  error  correc¬ 
tion  done)  across  the  network  by  the  ATM,  Because  it  is  a  connection-oriented  technique,  a  path  must 
be  established  between  the  users  before  any  information  can  be  exchanged. 

ATM  Layers; 

The  ATM  layer  is  concerned  with  transporting  information  across  the  network.  It  uses  virtual  connec¬ 
tions  for  information  transport  and  is  divided  into  the  virtual  path  and  the  virtual  channel.  The  ATM  layer 
performs  functions  such  as  generic  flow  control,  cell  header  generation  and  extraction,  cell  virtual  identi¬ 
fier  translations  and  cell  multiplexing  and  demultiplexing. 

Due  to  the  statistical  behavior  of  the  ATM  network  and  the  fact  that  a  theoretical  unlimited  number  of 
virtual  connections  can  share  the  same  link,  it  is  required  that  an  ATM  switch  synchronize  to  the 
incoming  cell  stream  and  examine  the  header  field  of  each  cell  in  order  to  identify  the  virtual  connection 
and  derive  routing  and  control  information  for  the  switch  network. 

Another  important  aspect  of  the  ATM  protocol  is  the  ATM  Adaptation  Layer  (AAL).  The  ATM  layer  is  in¬ 
dependent  of  the  services  it  carries  and  it  relies  on  the  AAL  to  adapt  higher  level  data  into  a  format  that 
it  can  handle.  AAL  performs  the  necessary  mapping  between  the  ATM  layer  and  the  higher  layers. 

Also,  due  to  the  payload  independence,  all  the  functions  specific  to  the  services  are  provided  at  the 
boundary  of  the  ATM  network  and  are  performed  by  the  AAL.  Within  the  AAL,  data  flow  can  be  cor¬ 
rupted  by  errors  in  transmission  or  it  can  suffer  cell  delay  variation  as  a  result  of  variable  delay  in  buffers 
or  through  congestion  in  the  network,  thereby  leading  to  cell  loss  or  misdelivery  of  ceils  [2].  Since  the 
services  are  dependent  on  the  AAL  type,  it  supports  multiple  protocols  to  fit  the  needs  of  the  user". 

2.1 .2  Properties  of  ATM 
Merits 

1  ATM  has  the  flexibility  to  match  the  rate  of  transmission  to  the  rate  of  reception  (i.e.  variable  bit 
rate).  ATM  can  transport  constant-bit-rate  information  at  the  same  time  as  a  variable-bit-rate 
bursty  data.  It  does  this  by  using  a  different  ATM  adaptation  layer  for  each  service  being  car¬ 
ried.  Information  can  be  transmitted  through  existing  copper  wire  as  well  as  through  optical 
fibers. 

2  The  transmission  of  cells  is  matched  to  the  generation  of  information  such  that  it  does  not 
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transmit  wasted  cells  during  period  of  low  multiplexing.  This  is  known  as  statistical  multiplex¬ 
ing. 

3  ATM  offers  the  ability  of  integrating  various  sources  or  media  into  one  link.  It  can  be  applied  to 
data,  voice  and  video  and  may  be  used  to  integrate  these  applications. 

Demerits 

1  Cell  delay  variations  (CDV) 

This  arises  from  the  variable  delays  introduced  into  the  network  by  the  queues  at  the  switches 
and  multiplexers.  This  delay  leads  to  a  change  in  the  expected  gap  between  cells  and  Is  most 
acute  in  delay  sensitive  services  such  as  speech  transmission. 

2.  Cell  Assembly  Delay  (CAD) 

This  arises  because  information  from  the  source  is  buffered  until  there  is  sufficient  information 
to  fill  a  cell.  The  waiting  time  in  the  buffer  depends  on  the  arrival  rate.  This  delay  is  also  acute 
in  delay  sensitive  services. 

2.1.3  Theoretical  Considerations  and  Practical  Limitations 

The  theoretical  analysis  of  the  ATM  systems  is  beyond  the  scope  of  this  report.  However,  we  will  men¬ 
tion  some  important  active  research  issues  in  ATM  networks. 

One  of  these  issues  is  the  cell  loss  performance  of  ATM  networks.  Cell  loss  probability  has  been  esti¬ 
mated  by  model  simulations  using  Monte  Carlo  and  Importance  Sampling  simulation  methods.  The 
problem  with  these  types  of  analysis  is  that  the  statistical  behavior  of  the  input  and  output  queuing  prop¬ 
erties  of  the  ATM  switch  has  additional  restrictions  imposed  on  them.  This  is  because  the  changes  im¬ 
posed  by  ATM  queuing  system  introduces  nonlinearity,  large  memory  with  random  length,  and  complex 
traffic  conditions.  Thus  a  more  suitable  model  (other  than  the  conventional  queuing  theory)  needs  to  be 
developed  whereby  the  bias  is  applied  to  the  conditional  arrival  probabilities. 

Some  of  the  practical  aspects  of  ATM  include  the  issue  of  generating  and  collecting  ATM  traffic  using 
the  adapter  cards,  the  vulnerability  of  ATM  signals  to  burst  errors,  and  link  BER  characterization  varia¬ 
tion  as  a  function  of  transmit  power  on  the  uplink. 


2.2  Advanced  Communication  Technology  Satellite  (ACTS) 

Spectrum  congestion  resulting  from  increased  demand  for  telecommunications  services  have  forced 
satellite  system  planners  to  consider  operating  at  very  high  frequencies.  ACTS  Is  NASA's  latest  experi¬ 
mental  communication  satellite  which  was  developed  In  order  to  research  on  howto  accommodate  this 
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projected  increase  in  telecommunications  demand  in  the  future  by  effective  utilization  of  the  frequency 
spectrum.  ACTS  operates  in  the  Ka-band  (30/20  GHz)  where  atmospheric  distortions,  especially  rain  at¬ 
tenuation,  are  severe. 

The  on-orbit  configuration  of  ACTS  is  shown  in  Figure  1  [3].  ACTS  employs  several  elements  not  found 
[  in  conventional  bent-pipe  transponders.  Among  many  futures  of  ACTS,  the  following  are  more  pro¬ 
found: 


15.2  ft 


47.1  ft 


Ka<band 
command, 
ranging,  and 
telemetry 
antennas  — 


7.2-ft.  30-GHz 
receiving  antenna 


3.3-ft  steerable  antenna 


C-band  omni  antenna 

Dual  subreflectors 


29.9  ft 


10.8-ft, 

20-GHz 

transmitting 

antenna 


Figure  1:  ACTS  On-orbit  Configuration 


Muitibeam  Antenna  (MBA) 

ACTS  employs  multiple  dynamically  hopping  spot  beams.  The  MBA  consists  of  two  high-gain  offset- 
feed  Cassegrain  antenna  systems:  one  for  uplink  reception  at  30  GHz  band  (2.2  m)  with  nominal  gain  of 
60.2  dB,  and  the  other  for  downlink  transmission  at  20  GHz  band  (3.3  m)  with  nominal  gain  of  56.7  dB. 
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Each  antenna  has  five  ports,  two  Baseband  Forming  Network  (BFN)  ports  and  three  fixed-beam  ports. 
Any  part  of  the  port  can  be  connected  to  the  Baseband  Processor  through  the  waveguide  switches. 
These  BFN  ports  are  the  East  and  West  hopping  beams  receiving  families.  The  operation  of  these  two 
families  are  similar.  Also  connected  to  the  West  beam  family  is  the  steerable  antenna  which  can  be 
connected  to  any  point  in  U.S.A.  and  Canada.  It  is  possible  to  transmit  and  receive  simultaneously 
through  the  steerable  antenna,  but  It  can  only  access  one  ground  location  at  a  time.  If  experiments  are 
not  co-located,  using  the  steerable  beam  for  uplink  and  downlink  will  not  be  possible.  Also,  Its  main 
reflector  is  smaller  and  thus  smaller  gain  is  achieved  when  the  steerable  antenna  is  used. 

On-board  Switching  and  Processing  System 

There  are  two  radiofrequency  communication  modes:  Baseband  Processor  mode  (BBP)  and  the 
Microwave  Switch  Matrix  mode  (MSM).  In  the  BBP  mode  the  received  signal  is  demodulated,  encoded 
or  decoded  if  necessary,  routed,  and  remodulated  by  the  baseband  processor  and  then  retransmitted. 
Modulation  for  BBP  must  be  serial  MSK.  BBP  can  receive  modulated  signal  at  two  different  trans¬ 
mission  rates  but  output  is  limited  to  100  mega  symbols  per  second.  It  can  accommodate  15  dB  of 
uplink  fade  and  6  dB  of  downlink  fade  and  still  achieve  bit  error  rate  of  10"®  [3]. 

For  the  MSM  mode  of  operation,  the  received  signal  are  transmitted  back  without  processing.  The  MSM 
is  a  programmable  4X4  switch  that  can  connect  a  maximum  of  three  transmitters  to  three  receivers.  It 
can  operate  in  two  modes:  the  static  mode  or  the  dynamic  mode.  The  static  mode  is  equivalent  to  the 
conventional  “bent  pipe"  satellite  mode  of  operation  in  which  the  switch  configuration  is  fixed.  In  this 
mode,  traffic  can  be  FDMA  or  very  high-burst-rate  TDMA  and  bandwidth  is  limited  to  one  GHz  In  the  • 
dynamic  mode,  traffic  is  switched  around  rapidly,  and  the  modulation  method  is  limited  to  digital  modu¬ 
lation  schemes,  such  as  BPSK  or  QPSK.  MSM  mode  can  support  12  dB  (8  dB)  of  uplink  (downlink) 
fades  and  still  achieve  bit  error  rate  of  1 0'®. 

3  TEST  OBJECTIVE 

Rome  Laboratory  is  part  of  the  Secure  Survivable  Communications  Network  (SSCN)  program  which  has 
the  objective  (among  others)  of  determining  the  feasibility,  desirability  and  usefulness  of  ATM  protocol 
as  the  communication  backbone  in  a  tactical  environment.  To  accomplish  this  objective,  RL  will  con¬ 
duct  series  of  tests  over  satellite  links  carrying  ATM  cells. 

Satellites  provide  fast,  low  cost  communication  infrastructure  deployable  in  a  tactical  environment  where 
terrestrial  network  does  not  exist.  ATM  allows  access  to  all  types  of  media  (voice,  data,  video).  A  com¬ 
bination  of  ATM  and  satellite  provide  an  improved  quality  of  service.  This  test  will  be  an  investigative 
test  with  the  objective  of  verifying  the  above  assertions.  Each  phase  of  this  test  will  be  further  subdi¬ 
vided  into  two  subtests. 
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3.1  Subtest  I 

Subtest  I  will  be  for  the  characterization  of  the  satellite  channel.  Its  purpose  will  be  to  establish  the 
operating  points  of  the  channel  and  for  link  budget  analysis.  To  specify  the  transmission  quality  of  ATM 
cells  over  the  ACTS  at  the  rate  of  interest,  the  transmission  performance  obtained  in  conjunction  with 
the  Earth  Station  Terminals  (EST)  must  be  determined.  In  order  to  determine  this,  the  carrier-to-noise 
ratio  and  the  receiving  system  noise  temperature  in  the  presence  and  absence  of  rain  is  considered. 
This  is  because  the  most  limiting  factor  in  the  Ka-band  satellite  transmission  is  rain  fade. 

Using  the  information  on  ACTS  and  the  EST  parameters,  link  budget  calculations  were  made.  These 
calculations  include  the  uplink  and  downlink  budget  for  RL  node,  the  estimated  uplink  and  downlink 
budget  for  CRC  and  the  uplink  and  downlink  budget  for  JITC. 

3.2  Subtest  II 

Subtest  II  will  be  concerned  with  the  application  of  ATM  protocol  over  the  satellite.  To  determine  to 
what  extent  ATM  cells  can  be  transmitted  over  the  ACTS,  file  transfer,  video  teleconferencing  (VTC) 
and/or  both  will  be  used.  Files  (data  files  only)  will  be  created  and  transmitted  over  the  link.  A  copy  of 
the  transmitted  files  will  be  kept  for  comparison  with  the  received  files.  Using  the  assets  of  the  network 
group  at  RL  and  presumably  at  CRC  and  JITC,  VTC  will  be  established  between  any  two  nodes.  The 
success  or  failure  of  the  VTC  will  be  noted  and  participating  members  will  be  asked  to  evaluate  the 
quality  of  the  services. 

File  transfer  and  video  teleconferencing  will  be  carried  out  simultaneously  and  if  possible  a  voice  trans¬ 
mission  will  be  integrated.  The  objective  is  to  determine  the  multimedia  capabilities  of  ATM  trans¬ 
missions. 

3.3  Data  Acquisition  and  Analysis  of  Result 

During  link  characterization,  the  cell  loss  rate,  cell  error  rate  and  the  link  bit  error  rate  will  be  determined 
or  computed  from  measured  data.  The  data  collecting  forms  are  given  in  Appendix  B. 

The  transferred  files  are  compared  with  the  received  files  in  order  to  determine  the  following: 

a.  Percent  of  transferred  files  without  error  and  the  percent  of  errored  files. 

b.  For  the  errored  files,  the  severity  of  the  error. 

c.  Any  correlation  of  error  with  the  properties  of  the  transmission  data. 

All  relevant  information  must  be  recorded  on  data  collection  sheet. 

For  the  VTC  case,  the  quality  of  the  video  transfer  must  be  determined.  The  participants  are  asked  to 
subjectively  evaluate  the  quality  of  service  and  record  their  observations. 
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An  important  objective  in  any  experiment  is  to  collect  some  information.  A  more  important  objective  is 
to  be  able  to  interpret  accurately  the  meaning  of  the  collected  data.  In  this  experiment,  we  expect  ATM 
cells  to  be  lost,  files  to  be  corrupted  and  link  BER  to  rise  above  expectations.  This  experiment  must  try 
to  answer  why  these  errors  occur  so  that  subsequent  experiments  will  prevent  or  minimize  the 
occurrence  of  these  errors. 

To  achieve  this  objective,  the  results  will  be  discussed  with  reference  to,  a)  BER,  b)  causes  of  cell  loss, 
c)  discussion  of  limiting  factors,  and  d)  the  correlation  of  data  to  error.  The  issue  of  generating  enough 
data  to  fully  load  the  channel  should  be  discussed.  And  if  the  channel  is  not  fully  loaded,  the  exact  rate 
at  which  experiment  was  performed  must  be  specified. 

4  SETUP  AND  CONFIGURATION 


Figure  2:  Experiment  Setup  (USA  and  Canada) 
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4.1  Experiment  Description 

Phase  I  experiment  will  link  RL  to  CRC.  The  terminals  at  RL  and  CRC  will  transmit  and  receive  ATM 
signal  at  DS-3  rates  over  the  ACTS.  CRC  will  provide  the  two  terminals  for  this  experiment.  RL  has  a 
commitment  from  CRC  to  upgrade  two  of  their  old  terminals  (originally  used  with  the  Olympus  satellite 
at  DS-1  rate)  to  be  able  to  operate  at  DS-3  rates.  It  is  our  understanding  that  these  two  terminals  are 
upgradable  to  be  compatible  with  the  ACTS.  For  this  experiment,  we  suggest  the  following  scenario. 

ACTS 


RL,  USA  CRC,  CANADA 

(East  Family  Spot  Beam  #4E)  (Steerable  Beam) 


RL: 


Latitude:  43“  13’ 13.176’ N 

Longitude:  75“  24'  34.003"  W 


Spot  Beam  4E:  Latitude:  42“  75’  N 
Longitude:  74“  86’  W 


CRC:  Latitude:  46.05“  N 
Longitude:  77.20“  W 


Phase  II  experiments  will  link  RL  to  JITC.  The  transmission  rate  for  this  experiment  will  be  at  OC-3 
rates.  The  terminals  (two  required)  wiil  be  provided  by  NASA.  These  are  the  HDR  earth  terminals 
offered  by  NASA  to  experimenters  in  conjunction  with  the  ACTS.  For  this  experiment,  we  suggest  the 
following  scenario. 

ACTS- 


RL,  USA  JITC,  USA 

(East  Family  Spot  Beam  #4E)  (West  Family  Spot  Beam) 


RL: 


Latitude:  43“  13’ 13.176’ N 
Longitude:  75“  24’  34.003’  W 


JITC:  Latitude:  31“  134’ 17"  N 
Longitude:  1 10“  20’ 38’ W 


Spot  Beam  4E:  Latitude:  42“  75’  N  West  Scan  Beam  centered  at  Phoenix,  AZ. 

Longitude:  74“86’W  Latitude:  30“30’N 

Longitude:  112“  W 

4.2  Phase  I  Experiment 

The  proposed  end-to-end  block  diagram  of  phase  I  experiment  is  shown  in  Figure  3.  As  was  indicated 
earlier,  the  satellite  transmission  equipments  and  the  terminals  will  be  provided  by  the  Canadians. 

CRC  is  still  working  on  the  upgrade  of  these  equipments  and  full  Information  is  not  yet  avaiiable.  In  the 
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Figure  3:  Phase  I  Experiment 
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Elhemet/FDDI 
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absence  of  specific  information  on  the  terminals,  we  will  assume  the  components  shown  in  the  Figure. 
The  dotted  boxes  signify  assumed  components.  Satellite  data  and  application  will  be  provided  via  the 
GTE  SPANet  ATM  switch  for  the  RL  node.  The  SPANet  switch  at  RL  is  already  fully  functional  and 
equipped  with  DS-3  modules.  No  information  is  available  on  the  ATM  switch  at  CRC.  Since  full  infor¬ 
mation  is  not  available  on  the  proposed  Canadian  terminals,  we  will  assume  a  mirror  image  of  RL 
equipments  for  CRC  until  such  times  when  specific  information  will  be  made  available. 

4.2.1  ATM  Switching  Equipment 

At  RL  the  ATM  generation  and  switching  equipments  shown  in  Figure  3  is  the  responsibility  of  the  Net¬ 
work  Group  RL/C3BC.  These  components  consist  of  the  GTE  SPANet  switch.  Device  Service  Units, 
Routers,  LANs,  and  Workstations.  These  components  are  already  installed  and  operable.  These 
equipments  are  described  below. 

1 .  ATM  Interface  Hardware 

The  SUN  workstations  (W/S)  at  RL  is  equipped  with  ATM  interface  hardware  to  generate  ATM  cells. 
Each  W/S  is  equipped  with  ATM  adapter  card. 

2.  GTE  SPANet  Switch 

The  GTE  SPANet  switch  is  the  major  ATM  switching  equipment  at  RL.  The  configuration  of  this  switch 
is  as  follows  [4]: 

•  DS-3  Tmnk  Interface  Module  (DTIM-3)  with  three  45  Mbps  ports.  Each  port  can  access  the 
Ethernet  LAN  and/or  FDDI  LAN. 

•  DS-1  Interface  Module  (DIM)  with  four  1 .544  Mbps  ports.  Each  port  is  access  for  T-1 
carrier  or  TRI-TAC  circuit  switch  through  DTGI. 

•  Low-Rate  Trunk  Interface  Module(LTIM)  with  variable  rate  port.  Each  port  can  access  TRI- 
TAC  radio  and  switches,  satellite,  T-1  carrier,  and  "Generic  Low-Rate"  devices. 

•  SONET  Interface  Module  (SIM).  The  port  is  access  to  the  Fiber  Optic  cable  or  the  FDDI 
LAN  at  a  transmission  rate  of  155  Mbps. 

•  Workstation  Interface  Module  (WIM).  This  module  provides  access  to  workstations  at  a 
maximum  speed  of  100  Mbps. 

3.  Switch  Accessories 

The  other  accessories  used  in  conjunction  with  the  SPANet  switch  are  the  Device  Service  Units 
(DL3200),  CISCO  Router  AGS+,  and  ATM  cell  converters. 
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4.2.2  Satellite  Transmission  and  Communication  Equipments 

An  ATM  Link  Enhancer  (ALE)  is  recommended  for  this  experiment.  At  high  data  rate,  the  ALE  imple¬ 
ments  selective  interleaving  which  significantly  improves  the  performance  of  ATM  traffic  over  satellite 
links.  The  ALE  is  capable  of  bit  interleaving  of  the  ATM  header  which  improves  the  performance  of  the 
header  error  correction,  thereby  providing  an  acceptable  "cell  discard  probability"  [5]. 

The  modem  and  other  terminal  equipments  will  be  provided  by  CRC,  Canada.  Meanwhile,  we  will 
assume  the  equipment  configuration  as  shown  in  Figure  3  for  the  Canadian  terminals.  The  modem  is 
assumed  to  be  EFDATA  450  which  has  the  capability  of  handling  up  to  45  Mbps.  The  satellite  trans¬ 
mission  equipment  is  assumed  to  be  standard  EHF  satellite  equipment. 

4.2.3  Test  Equipments 

The  HP  75000  Broadband  analyzer  is  identified  as  the  ideal  test  equipment  and  is  recommended  for 
this  experiment.  This  piece  of  equipment  is  capable  of  generating  ATM  cells  and  monitoring  the  perfor¬ 
mance  of  the  switch  as  well  as  the  satellite  channel.  Different  data  on  the  ATM  cells,  the  switch  and  the 
link  can  be  collected  with  this  equipment.  This  equipment  is  also  vital  for  link  characterization. 

4.2.4.  Test  Procedure 
Subtest  I 

1 .  Set  up  the  equipment  as  shown  in  Figure  3. 

2.  Align  the  earth  station  antenna  at  both  test  nodes  to  the  ACTS  and  set  the  frequencies  and 
power  levels  as  specified  in  the  satellite  access  approval. 

3.  Initialize  the  test  equipments  HP  75000,  and  the  ATM  cell  generating/switching  equipments  in 
accordance  to  the  instructions  contained  in  the  operations  manual  or  in  accordance  to  the 
direction  of  the  network  branch. 

4.  Observe  and  record  the  status  of  all  the  equipment  at  time  t=0  (initial  point).  Calibrate  the 
equipment  if  necessary. 

5.  GTE  SPANet  switch  setup  instructions: 

•  Please  consult  the  network  branch  for  setup  and  operation  instructions. 

6.  Use  the  HP  75000  to  generate  and  send  ATM  sample  data  for  the  subtest  I.  To  do  this,  follow 
the  procedure  as  specified  below.  Select  “OK"  or  “Close  Window*  at  the  end  of  your  selection. 

•  Open  the  Test  Session  Manager  and  configure  the  system  as  follows: 

Set  Adaptation:  AAL-1 

All  Others:  Default  values 

•  Configure  the  test  procedure 

Set  the  active  port  by  activating  the  DS-3  interface. 
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•  open  the  Configure  interface  window  and  set  the  following  parameters; 

Select  type  of  connection  such  that  signal  travel  to  and  from  the  system  under  test 
(SLIT)  via  the  transmitter  and  receiver  independently. 


Framing  Format;  C-bit 

Interface;  UNI 

Cell  Scrambler/Discramblen  OFF 

Line  Scrambler/Discrambler  OFF 

Header  Correction;  ON 

All  Others;  Default  Values 


♦  Cell  Generation 

•  Open  the  PDU  sequence  builder  under  the  Builders  window 

Under  this  window,  select  Create  and  give  the  PDU  name  as  “Test_j)hase1  * 
Select  edit  if  necessary  to  edit  your  selection. 

•  Open  the  LIF  sequence  builder  and  create  sequence  from  the  ATM  cells. 

•  Open  the  Foreground  Cell  Generation  window. 

Open  Set  Content  window  (under  Cell  Access)  and  set  the  following; 


VPI; 

VCI; 

PT; 

Cell  Loss  Priority; 
Defined  Sequence; 
Sequence  Length 
AAL  Type; 

All  Others; 


1 

1 

0 

1 


Test_Phase1 
1500* 

1 

Default  values 

This  equipment  can  only  generate  a  maximum  of  1500  cells  per  second  in  the  foreground  condition 
and  16  cells  per  second  in  the  background  condition.  This  is  not  enough  for  45  Mbps  data.  At  this 
rate,  the  required  number  of  cells  is  106,132.08  cells  per  second.  This  is  a  problem  that  is  still  to  be 
resolved.  Filling  the  pipe  with  the  HP  75000  is  not  possible!  1 1 
•  Open  the  ATM  Cell  Selector  under  Receiver  window  and  set  the  following; 

VPI;  1 

VCI;  1 

PT;  0 

SN  Correction;  OFF 

AAL  Type;  1 
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All  Others; 


Default  values 


♦  Activate  the  monitor  and  capture  cell  stream  in  the  Capture  Memory  window. 

•  Open  the  Foreground  Channel  Generator  and  activate  the  transmit  button. 

Select  the  display  icon  to  view  the  captured  PDUs.  Store  these  PDUs  for  comparison 
with  the  received  PDUs. 

7.  Data  Collection 

•  Use  appropriate  data  sheets  for  data  collection.  Please  indicate  and  record  ail  observa¬ 
tions.  Use  extra  sheets  if  necessary. 


Subtest  II 

To  demonstrate  the  extent  by  which  ATM  can  be  transmitted  over  ACTS,  high  speed  file  transfer,  VTC 
or  both  should  be  used  to  test  the  usefulness  of  ATM.  For  this  demonstration  set  up  the  experiment  as 
shown  in  Figure  4. 


Satellite 
I  Transmission 
Equipment 


ATM 

Switching 

Equipment 


Satellite 

Transmission 

Equipment 

T 


WS#1 


WS#2 


ATM 

Switching 

Equipment 

R 

WS#3 

WS#4 

Figure  4:  Application  Experiment  Setup 


Workstations  #2  and  #3  should  run  a  point  to  point  VTC  between  RL  and  CRC.  Setup  and  operation  of 
the  VTC  is  the  responsibility  of  the  Network  Branch.  Consult  with  the  network  group  for  direction  on  the 
VTC  setup.  Workstations  #1  and  #4  should  be  pre-loaded  with  a  data  intensive  file  (any  file  or  combi¬ 
nations  of  files  at  least  45  magabyte).  These  files  will  be  transferred  between  the  two  nodes. 
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4.2.5  Phase  I  Equipments 


RL  Node: 


Description 

On-Site  Availability 

Obtainable 

Sun  Sparc  10  Workstation 

Yes 

ATM  Computer  Interface  @  45  Mbps 

Yes 

GTE  SPANet  Switch  &  Accessories 

Yes 

Video  Teleconference  Components 

-  Camera  with  accessories 
-Tripod  Stand 

-  Speakers 

-  Amplifier  and  Microphone 

Yes 

ATM  Link  Enhancer 

No 

Yes  (COMSAT) 

Ka-band  antenna  with  up/down  converters,  feed  systems 
and  accessories. 

No 

Yes  (CRC) 

EFData  modem,  SDM-450 

No 

Yes  (COMSAT) 

EFData  Buffer  EB-450 

No 

Yes  (COMSAT) 

HP  75000  Broadband  Analyzer 

No 

Yes  (Mitre) 

Firebird  Communication  Analyzer  MC6000 

Yes 

CRC  Node: 


Description 

On-Site  Availability 

Obtainable 

Sun  Sparc  1 0  Workstation 

Yes? 

ATM  Computer  Interface  @  45  Mbps 

Yes? 

GTE  SPANet  Switch  &  Accessories 

Yes? 

Video  Teleconference  Components 

-  Camera  with  accessories 
-Tripod  Stand 

-  Speakers 

-  Amplifier  and  Microphone 

Yes? 

ATM  Link  Enhancer 

No? 

Yes  (COMSAT) 

Ka-band  antenna  with  up/down  converters  and  feed  sys¬ 
tems  and  accessories. 

Yes 

EFData  modem,  SDM-450 

No? 

EFData  Buffer  EB-450 

No? 

HP  75000  Broadband  Analyzer 

? 

Yes  (Mitre)  ? 

Firebird  Communication  Analyzer  MC6000 

? 

Yes  (RL)  ? 

Note:  ?  is  used  to  indicate  unanswered  questions  regarding  the  equipments 
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4.3  Phase  il  Experiment 

Figure  5  shows  the  setup  of  the  test  configuration  for  the  phase  II  experiment.  As  we  previously  indi¬ 
cated,  the  experiment  will  be  conducted  between  RL  and  JITC. 

4.3.1  ATM  Switching  Equipments 

As  the  block  diagram  indicates,  both  RL  and  JITC  already  have  fully  functional  ATM  switching  equip¬ 
ments.  Both  sites  have  the  GTE  SPANet  switch  equipped  with  SONET  Interface  Module  (SIM)  operat¬ 
ing  at  OC-3  rates.  This  switch  has  been  described  in  section  4.2.1. 

4.3.2  Satellite  Transmission  and  Communication  Equipments 

The  terminals  for  this  phase  of  the  experiment  will  be  provided  by  NASA.  This  terminal  is  the  3.4m  Ka- 
band  transrnit/receive  HDR  transportable  terminal  system  assembled  by  BBN  Systems  and  Technolo¬ 
gies.  These  terminals  will  transmit  and  receive  ATM  signals  at  OC-3  rates  via  the  ACTS  between  RL 
and  JITC. 

On  the  recent  ACTS  HDR  experimenters  meeting  held  in  Cambridge  Massachusetts  at  the  BBN 
Systems  &  Technology  complex,  on  July  6-7, 1994,  the  issues  concerning  the  HDR  experiment  were 
outlined.  These  issues  include  the  HDR  Earth  Station  site  requirement,  HDR  safety  handling  proce¬ 
dure,  HDR  terminal  distribution  allocation,  ACTS  time  allotment,  availability  and  acquisition  of  the  HDR 
terminals.  NASA,  in  conjunction  with  BBN  Systems  and  Technologies,  will  provide  all  the  necessary 
components  and  software  to  run  the  terminal.  Also,  the  transportation  of  the  terminals  to  and  from  the 
sites  will  be  provided  by  NASA.  However,  the  sites  (RL  &  JITC)  will  respectively  be  responsible  for  the 
terminals  site  preparation,  approval  and  the  operation  of  the  terminal  after  it  is  installed. 

4.3.3  Testing  Equipments 

Since  the  connectivity  of  the  terminal  is  not  available  at  this  time,  we  will  assume  that  the  HP  75000 
and  Firebird  MC6000  will  be  compatible  with  the  HDR  terminal  system. 

4.3.4  Test  Procedure 

To  stipulate  the  test  procedure,  we  will  assume  that  no  extra-ordinary  procedure  is  required  for  the  HDR 
operation.  If  this  is  tme,  the  test  procedure  will  be  identical  to  the  procedure  outlined  in  section  4.2.4  for 
the  phase  I  experiment.  The  only  difference  is  that  this  experiment  will  be  performed  at  155  Mbps  as 
opposed  to  45  Mbps.  In  other  words,  the  terminals  will  be  equipped  with  ATM/SONET  user  interfaces 
modules.  Through  these  interfaces,  this  experiment  will  be  performed  via  the  ACTS. 

In  this  phase  of  the  experiment,  we  also  encounter  the  problem  of  generating  enough  data  for  OC-3 
transmission  rate.  At  this  rate,  we  require  to  generate  365,566.04  cells  per  second.  The  generation 
and  testing  of  cells  big  enough  to  file  the  pipe  is  the  bottleneck  to  this  experiment.  To  the  best  of  our 
knowledge,  no  equipment  currently  exist  for  the  generation  of  such  cells  per  second. 
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4.3.5  Phase  II  Equipments 


RUJITC  Nodes: 


Description 

On-Site  Availability 

Obtainable 

Sun  Sparc  10  Workstation 

Yes 

ATM  Computer  Interface  @  155  Mbps 

? 

? 

GTE  SPANel  Switch  &  Accessories 

Yes 

Video  Teleconference  Components 
-  Camera  with  accessories,  Tripod  Stand,  Speakers 
>  Amplifier  and  Microphone 

Yes 

HDR  Terminal  with  all  the  accessories 

No 

Yes  (NASA) 

ATM  Link  Enhancer 

No 

Yes  (COMSAT) 

HP  75000  Broadband  Analyzer 

No? 

Yes  { Mitre)  7 

Firebird  Communication  Analyzer  MC6000 

Yes 

5  SUMMARY 

In  this  paper,  we  have  described  a  program  plan  for  transmitting  high  data  rate  ATM/SONET  data  over 
the  ACTS.  This  plan  is  described  based  on  the  available  information  during  the- summer  research  pro¬ 
gram  at  the  Rome  Laboratory. 

The  execution  of  this  plan  is  contingent  upon  several  factors.  Phase  I  experiment  depends  on  the 
ability  of  CRC  to  upgrade  their  terminal  to  ACTS  specification.  Phase  II  depends  on  the  availability  of 
the  HDR  terminal  from  NASA. 
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Abstract 

This  report  deals  with  the  application  of  time-frequency  distributions(TFD)  to  direct 
sequence  spread  spectrum(SS)  communication  systems.  The  case  studied  is  that  of  a 
jammer  with  time-varying  characteristics.  The  capability  of  the  newly  devised  TFDs  to 
properly  localize  a  single  as  well  as  multiple  component  signals  in  time  and  frequency 
permit  unbiased  low  variance  estimation  of  the  interference  instantaneous  frequency  under 
abrupt  as  well  as  evolutionary  rapidly  changing  conditions.  This  estimate  is  then  used  to 
construct  a  finite  impulse  response  filter  which  substantially  reduces  the  interference  power 
with  a  minimum  possible  distortion  of  the  desired  signal.  This  two  step  mechanism  for 
interference  excision  can  be  viewed  as  a  case  of  an  open  loop  adaptive  filtering.  However 
contrary  to  the  existing  techniques  of  close  loop  self-tuning  linear  predictors  or  open  loop 
adaptive  filtering  based  on  fast  Fourier  transforms,  the  filter  coefficients  in  the  proposed 
technique  are  obtained  via  time-varying  spectral  analysis.  Closed  form  expressions  of  the 
improvement  of  SNR  at  the  receiver  correlator  output  using  TFD-based  adaptive  filtering 
are  derived.  Results  show  that  time-frequency  representations  of  signals  should  be 
considered  as  an  important  tool  to  immune  the  SS  systems  to  smart  jamming. 
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Introduction 

Spread  spectrum  (SS)  systems  are  widely  used  in  communications  in  a  variety  of 
applications  including  suppression  of  a  strong  interfering  signal  due  to  jamming  or 
multipath  propagation  and  in  low  probability  of  intercept  communications.  The  spread 
spectrum  system  is  characterized  by  1)  The  signal  occupies  a  bandwidth  much  in  access  of 
the  minimum  bandwidth  necessary  to  send  the  information.  2)  Spreading  is  accomplished 
by  means  of  spreading  signal,  often  called  a  code  signal,  which  is  independent  of  the  data. 
3)  At  the  receiver,  despreading  for  recovering  the  original  data  is  accomplished  by  the 
correlation  of  the  received  spread  signal  with  a  synchronized  replica  of  the  spread  signal 
used  to  spread  the  information. 

The  most  commonly  used  type  of  SS  is  the  direct  sequence  (DS)  in  which 
modulation  is  achieved  by  superimposing  a  pseudo  random  (PN)  sequence  upon  the  data 
bits.  In  the  receiver,  the  cross  correlation  with  the  replica  of  the  PN  sequence  transfers  the 
information  signal  back  to  its  original  bandwidth,  while  reducing  the  level  of  a  narrow-band 
interference  by  spreading  it  across  the  bandwidth  occupied  by  the  PN  sequence. 

The  performance  of  a  PN  SS  system  can  be  further  improved,  with  respect  to  its 
immunity  to  narrow-band  interference,  by  applying  an  excision  filter  prior  to  despreading. 
This  filter  suppresses  the  interference  and  thus  increases  the  signal  to  noise  ratio  (SNR)  at 
the  output  of  the  correlator.  Excision  can  be  performed  in  the  following  domains: 

1)  Frequency  Domain  :  The  (FFT)  of  the  data  over  one  information  bit  is  weighted  by 
appropriate  values  and  then  transformed  back  to  the  time  domain[l].  This  is  an  effective 
method  for  stationary  narrow-band  interference.  Sidelobes  may  present  a  problem  in 
removing  the  interference  without  loosing  some  of  the  signal  energy. 

2)  Time  domain  :  This  includes  adaptive  linear  predictors  and  smoothers(LMS,  RLS) 
[2,3].  Tracking  is  highly  dependent  on  the  SNR  and  often  fails  under  rapidly  time- 
varying  interference. 
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3)  Wavelet  domain:  The  discrete  wavelet  transform  (DWT)  is  applied  to  the  data  and 
the  coefficients  of  high  energy  are  removed  prior  to  the  inverse  transform[4].  The  DWT 
is  appropriate  for  cases  of  pulse  jamming  or  interference  with  burst  characteristics. 

4)  Time  and  frequency  domains  :  Non-parametric  spectral  techniques  are  first 
employed.  Then,  a  time-domain  transversal  filter  is  designed  from  the  spectral 
information  of  the  data[l].  Spectral  estimation  methods  combined  with  open  loop 
adaptive  filtering  were  shown  to  suffer  from  the  same  drawbacks  as  frequency-domain 
techniques. 


Non  of  the  above  methods  is  capable  of  properly  incorporating  the  time-varying 
nature  of  the  interference  frequency  characteristics.  Fig.l  shows  that  frequency  and 
wavelet  excision,  in  essence,  respectively  remove  all  desired  signal  information  over  the 
frequency  band  AF  and  time  duration  AT.  As  such,  we  maintain  that  in  the  case  of  a  chirp 
as  well  as  other  time-varying  interfering  signals,  frequency-domain  methods  ignore  the  fact 
that  only  few  frequency  bins  are  contaminated  by  the  jammer  at  a  given  time.  Wavelet- 
domain  excision  techniques,  on  the  other  hand,  do  not  account  for  the  interference 
characteristics  where  only  few  time  samples  may  be  contaminated  by  the  jammer  for  a 
given  frequency.  Applying  either  method  will  eliminate  the  interference,  but  unnecessarily 
reduce  the  desired  signal  energy. 

FreO;^ _  Time-Domain  Excision 


A 

Frequency- 

Domain 

Excision 

? 

Fig.l 

Time 
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Contributions 

In  this  report,  time-frequency  (t-f)  distributions  are  employed  for  performing  time- 
varying  spectral  analysis  on  the  received  signal  (see  Fig.2).  On  the  basis  of  the 
instantaneous  frequency  estimate,  three  and  five  coefficients  linear  phase  open  loop 
adaptive  filter  can  be  formed  to  annihilate  the  time-varying  interference  with  minimum 
possible  loss  of  signal  energy. 


The  objective  of  the  proposed  technique  is  to  highly  attenuate  the  received  signal  in 
those  time-frequency  regions  which  contain  strong  interference,  as  depicted  by  the  region 
in  between  the  dashed  lines  in  Fig.l.  The  effectiveness  of  the  t-f  based  interference 
suppression  algorithm  is  assessed  by  evaluating  the  performance  of  the  receiver  with  and 
without  the  interference  suppression  filter.  Analysis  and  simulations  clearly  show 
reduction  in  the  probability  of  error  using  t-f  techniques  over  time/frequency  only  methods. 

Time-Frequency  Distributions 

Time-frequency  distributions  (TFDs)  have  been  shown  to  be  a  powerful  tool  in 
signal  analysis  and  processing.  They  properly  depict  the  time-varying  characteristics  of 
nonstationary  signals.  TFDs  allow  separation  between  signals  overlapping  in  time  and  in 
frequency,  which  could  not  be  done  using  windowing  or  filtering  techniques.  TFDs  are 
characterized  by  a  two  dimensional  function.  This  function  is  referred  to  as  a  "kernel"  and 
can  be  designed  such  that  the  corresponding  TFD  satisfies  several  desired  properties.  For  a 
full  discussion  of  the  TFDs  and  kernel  design  methods,  we  refer  the  reader  to  references 
[5,6].  Among  the  desired  t-f  properties  is  the  capability  to  satisfy  the  instantaneous 
frequency  condition.  Generally,  this  property  allows  the  TFD  to  encounter  peaks  at  the 
derivative  of  the  phase  of  each  signal  component,  irrespective  of  their  time-varying  nature. 
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In  this  report,  we  use  TFD  methods  to  localize  a  rapidly  time-varying  interferer  in 
the  time-frequency  domain  and  excise  it,  yielding  a  higher  SNR  at  the  receiver  output  than 
that  obtained  using  other  excision  methods.  Below,  we  present  the  general  class  of  TFDs 
defined  by  Cohen[5]  and  discuss  its  relation  with  both  the  Wigner  distribution  (WD)  [7] 

and  the  ambiguity  function.  The  kernel  role  in  uniquely  characterizing  the  TFD  is 
highlighted.  Let  R^(t,x)h&iht  instantaneous  autocorrelation  of  the  complex  signal  f(t) 

Rf(t,x)  =  f(t+xl2)f(t-tl2)  (1) 

then 

W CO)  =  j  Rj  it,  x)e-^‘^dr  (2) 

and  the  ambiguity  function  is 

A/e,T)  =  ^jR/t,T)e^^dt  (3) 

with 

W/t,(0)  =  jj  A/d,t)e-J<^^‘^>dedr  (4) 

This  establishes  the  relationship  between  the  ambiguity  function  and  Wigner  distribution. 
Smoothing  in  both  the  time  and  frequency  directions  is  often  carried  out. 

W'j(t,(o)  =  -^\\  W/v,Cmt-v,co-C)dvdC.  (5) 

By  the  2-D  convolution  theorem,  we  have 

w;(t,co)  =  ^  J  J  A/d,  rmo,  =  C/r,  6), •<!>;,  (6) 

which  is  the  Cohen  class  of  time-frequency  distributions[5,6].  C^(r,to;0)  can  also  be 
expressed  as  the  Fourier  transform  of  the  generalized  autocorrelation  function 

Cf  it,  O)  =  J  ^  fit,  T)e~^'‘^dT  (7) 

where  R  fit,T)  =  ^j  Rfiu,T)(pit-u,i;)du 

The  autocorrelation  domain  kernel  (p(t,  r)  is  defined  as 

(p(t,t)  =  j^(e,x)e-^^de 

The  generalized  class  of  t-f  distributions  Cj.(/,  to;  O)  can  also  be  obtained  from  the  IFT  of 
the  generalized  spectral  correlation  R  f 

Cfit,co;0)  =  ^jRFico,e)e^^dd,  RF(co,e)  =  -^jR^(C,e)W(co-C,e)dC  (8) 
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with  Rp  and  W  representing  the  instantaneous  spectral  correlation  and  the  spectral 
correlation  domain  kernel 

Rp(co,  d)  =  F((0  +  e/2)F*(co-ei2)  ^(co,  e)  =  j  Of-0, 

respectively.  The  following  relations  follow  from  equations  ( 1-8  ) 

Rp(co,e)  =  Wp((o,t)  =  \Rp(o),e)e-^^‘de  =  27cW/-t,0)) 

Ap(x,  6)  =  —  \  Rp(co,  d)e^‘^do}  =  2kA^(-x,  d) 

2it^ 

Thus  the  generalized  time-frequency  distribution  can  be  viewed  in  four  ways  1)  The  2-D 
convolution  of  the  WD  with  the  time-frequency  domain  kernel.  2)  The  2-D  FT  of  the 
product  between  the  ambiguity  function  and  the  kernel.  3)  The  1-D  FT(with  respect  to  the 
lag  variable)  of  the  convolution(in  the  time  direction)  between  the  instantaneous 
autocorrelation  and  the  temporal  correlation  domain  kernel.  4)  The  1-D  lFT'(with  respect  to 
the  Doppler  shift  variable)  of  the  convolution(in  the  frequency  direction)  between  the 
instantaneous  spectral  autocorrelation  and  the  spectral  correlation  domain  kernel. 

Table  1  depicts  the  desired  properties  which  any  devised  t-f  distribution  should 
satisfy.  Each  property  translates  into  a  condition  on  the  employed  kernel.  All  conditions  are 
compatible  and  can  be  satisfied  simultaneously.  In  this  report,  we  emphasize  property  6. 
The  ability  to  estimate  the  instantaneous  frequency  is  the  key  for  using  the  t-f  distribution  in 
SS  communication  problems.  Two  commonly  used  kernels,  namely  the  binomial  and  Bom 
Jordan,  which  satisfy  all  constraints  are  shown  in  Fig.3  in  the  ambiguity  domain.  The  unit 
values  at  the  center  axes  are  necessary  to  satisfy  the  time  and  frequency  marginal 
constraints.  The  low-pass  filter  characteristics  lead  to  reduced  cross-terms,  which  are 
positioned  away  from  the  origin.  Cross-terms  are  undesired  components  and  are  produced 
from  the  data  bilinear  properties  of  the  distributions,  evident  in  the  above  equations. 

Figure  4  illustrates  the  time  and  time-frequency  plots  of  a  burst  of  a  single  sinusoid 
with  SNR=20dB.The  time-frequency  distribution  was  generated  using  the  binomial  kernel. 
The  mesh  and  contoiu’  plots  clearly  show  the  starting  and  ending  times  of  the  sinusoid. 
Fig.5  depicts  the  case  of  a  chirp  signal  in  noise.  The  chirp  signal  exists  over  the 
period[200-264].  In  addition  to  the  signal  duration,  the  WD  in  Fig.5  shows  a  peak 
positioned  at  the  instantaneous  frequency  of  the  chirp,  irrespective  of  its  rate  of  variation. 
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Table  1.  Distribution  properties  and  associated  kernel  requirements. 

PO.  nonnegativity  :  (p'j  >  OWt^u 

QO.  (p{9,t)  is  the  ambiguity  function  of  some  function  w{t). 

Pi.  realness  :  Cf{i,u]o)  £  R 
Ql. 

P2.  time  shift  :  g{t)  =  Cg{t,u-,0)  =  Cf{t  -  to,u- cf>) 

Q2.  (p{9,r)  does  not  depend  on  t. 

P3.  frequency  shift  :  g{t)  =  ^  Cg{t,u-,6)  =  C j{t,u  -  uo]  0) 

Q3.  <p{9,t)  does  not  depend  on  cu. 

P4.  time  marginal  :  ^  fWf(t,u)duj  = 

Q4.  (p{9,0)  =  1 

P5.  frequency  marginal  :  /  Cf{t,u;  (j))dt  =  F{u)F'{u) 

Q5.  d(0,r)  =  1  Vr 

P6.  instantaneous  frequency  : 

Q6.  Q4  and  ^^j,^o  =  0V9 
P7.  group  delay  :  =  t,{u) 

Ql.  Q5  and  =  0  Vr 

P8.  time  support  :  f{t)  =  0  for  [d  >  ^  =^>  9^)  =  0  for  Id  > 

Q8.  ^(dr)  =  fg>{9,T)c--'^^d9  =  0  for  |t1  <  2|d 

P9.  frequency  support  :  F{u)  =  0  for  |a;|  >  Cj{t,u-,  9^)  =  0  for  \uj\  >  tu, 

Q9.  f<p{9,ry'^dr  =  0  for  |^|  <  2la;| 

PlO.  Reduced  Interference 

QlO.  4>{9,t)  is  a  2-D  low  pass  filter  type. 
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Fig.3  Commonly  used  time-frequency  kernels 
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Fig.4  A  single  sinusoid  in  noise  (a)  the  signal,  (b)  the  mesh  plot,  (c)  the 
contour  plot 
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Fig.5  Contour  plots  of  a  chirp  in  noise  (a)  low  rate  (b)  high  rate 


Analysis 

In  the  transmitted  signal  generation,  the  number  of  PN  chips  per  information  bit  is 

L,  i.e., 

L 

=  (9) 

y=i 

where  p^j  represent  the  output  sequence  from  the  PN  code  generator  for  the  kth 
information  bit  and  q(t)  is  a  rectangular  pulse  of  duration  and  unit  energy.  The  total 
transmitted  signal  may  be  expressed  in  the  form 

=  (10) 

k 

where  {4}  represents  the  binary  information  sequence  and  =Lt^  is  the  bit  interval 
(reciprocal  of  the  bit  rate).  The  received  signal  has  the  form  r(t)  =  sit)  +  /(r)  +  nit) ,  where 
w(t)  is  the  uncorrelated  white  noise  process,  s(t)  is  the  desired  signal,  and  i(t)  the 
interference.  In  the  following,  we  present  three  different  cases  of  excision  filters  based  on 
TFDs.  Under  the  assumption  of  highly  rapidly-varying  environment,  the  length  of  the 
excision  filter  should  be  kept  to  a  minimum.  Since  nulling  a  cosinusoidal  function  requires 
at  least  three  coefficient  filter,  we  consider  in  Case  1  the  application  of  a  linear  phase  3- 
coefficient  filter  whose  zero  lies  on  the  unit  circle  with  the  phase  equal  to  the  instantaneous 
frequency  (obtained  from  the  TFD).  We  maintain  that  a  zero  on  a  unit  circle  implies  perfect 
nulling.  However,  due  to  problems  cited  in  case  1,  two  other  cases  are  considered  which 
we  advocate  for  use  in  TFD-based  excision  SS  systems. 

Case  1:  Three  coefficient  excision  Filter 

Let  h(n)  represent  the  impulse  response  of  the  linear  phase  excision  filter  with 
coefficients  h(0)=h(l)=l,  h(l)=-2  cos(  £0„),  where  co^  is  the  instantaneous  frequency  of  the 

time-varying  interference  obtained  using  TFD.  The  filter  input  is  the  received  signal  x(n) 
and  the  output  is  y(n) 

1 

y(n)=  Y^him)xin-m) 

m=-l 

By  linearity,  the  output  can  be  written  as 

yin)  =  p^in)  +  i„in)-\rw„in) 

where  the  subscript  "o"  denotes  output.  The  output  signal  is  fed  into  the  PN  correlator 
yielding  the  decision  variable  U. 
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U  =  '^pM)P  in)  +  '^i(n)p  in)  +  '^w^(n)p  (n) 

11=1  n=l  #1=1 

It  is  assumed  that  the  interference  is  eliminated  by  filtering.  Then, 

U  =  Y,Po(n)p  (n)  +  Y,^o(f^)P  (n)  =  U^+U^ 

n-\  #1=1 

The  first  term  can  be  detailed  as 

C/i  =  X  '^him)p(n  -m)  =  '^{p{n - 1)  +  p{n  + 1) -  2cos(n)J/7(/i))p(n) 

#t=l  #n=— 1  #1=1 

=  X  - 1)  +  2  P^^)P^^  + 1)  “  25^  cos(<»„  )p^  (n) 

#1=1  #1=1  #1=1 

The  following  assumptions  are  invoked  to  simplify  analysis  and  gain  preliminary  insights 
into  the  performance  of  the  proposed  method  under  rapidly  time-varying  environments. 

1)  The  PN  sequence  is  identically  distributed  random  variables  such  that  p(n)=l  ,-l 
with  equal  probability. 

2)  The  instantaneous  radian  frequency  is  a  random  variable  uniformly  distributed  over 
[0,27c]. 


With  the  above  two  assumptions,  the  expected  value  and  mean  square  values  of  U  can  be 
determined  as  follows 

E[U,  ]  =  X  E[p(n)p{n  - 1)]  -H  X  E[pin)p(n  + 1)]  -  2X  ^[cos(n)„ )]  (11) 


#1=1 


#1=1 


#1=1 


But  since 


^  2jr 

£:[cos(®  J]  =  —  J  cos((»J  dco^  =  0, 


(12) 


then 


E[C/,]  =0. 

Further,  it  can  easily  be  shown  that  E[t/2]  0  •  The  SNR  of  the  correlator  is  defined  as 

the  ratio  of  the  square  of  the  mean  to  the  variance  of  the  correlator  output  value) 


Accordingly, 

=£[£/, +c/j=o 
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(14) 


Equation  (14)  implies  that  under  the  above  two  assumptions,  the  SNR  becomes  which 
is  highly  undesired. 

Case  2:  Five  coefficient  excision  filters 

Next  we  try  a  five  coefficient  linear  phase  FIR  filter  with 

h(n)  =  [l  -4cos(fflJ  2  +  4cos^(toJ  -4cos(<oJ  1]  (15) 

Which  is  obtained  by  convolving  the  impulse  response  of  the  3-coefficient  filter  in  case  1 
with  itself.  It  is  clear  that  the  middle  term  in  the  impulse  response  is  no  longer  a 
cosinusoidal  function,  but  rather  includes  the  square  value  of  the  cosinusoid.  As  a  result, 
the  expected  value  of  the  correlator  output  will  no  longer  be  zero,  forbidding  the  SNR  to 
be  negative  infinity,  as  in  case  1.  Using  the  new  filter, 

L 

~  2)  -  Ap{n  -  l)cos(fflJ  -t-  (2  -i-  4  cos^(<» J)p(n) 

»=i  (16) 

-  4/7(n  +  l)cos(©„) -I-  p(n -f  2)]p(n) 

The  expected  values  of  the  first,  second,  forth  and  fifth  terms  in  the  above  summation  are 
all  zeros  due  to  the  uncorerlatedness  of  the  PN  and  the  instantaneous  frequency  as  well  as 
the  zero  mean  property  of  both  the  PN  sequence  and  the  cosinusoidal  function.  The 
expected  value  of  the  middle  term  is  given  by 

E[U^]  =  Y,E[2  +  Acos^{cof)\  =  2L  +  ^  fcos^(£0,)dffl„  =  4L  (17) 

K=i  2k  ' 

Because  the  noise  is  uncorrelated  with  the  PN  sequence,  £[^21  =  0.  Therefore  the 
numerator  in  equation  (13)  is  given  by 

E[y,^]  =  E'^[UJ  =  \ei}  (18) 

The  variance  of  U  can  be  computed  as 
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=  Y^'^E[pin)p{n  -  2)pik)p(k  -  2)}  +  '^E{pin)p(n  +  l)p(k)p{k  +  2)} 

k=\  «=1  k=*l  n*l 

+  l6^'^E{pin)p(n  -l)p{k)  p{k  - 1)  cos(to„ )  cos(ci)ji )} 

k-=l  n=l 
L  L 

+ 1 6  £ [p{n)p{n  +  \)p{k)p{k  + 1)  cos(cd  J  cosC®* )} 

k*l  n*l 
L  L 

+  515^£’{(2  +  4cos((Wj)(2  +  4cos(g)J)  (19) 

k-1  n»l 

+  Y^Pin)p{n  -  2)p(k)p(k  +  2)  +  Y^pik)pik  -  2)p{n)p{n  +  2) 

k=l  n=l  k=l  n=l 


The  rest  of  the  terms  in  the  mean  square  value  of  Ui  represent  the  cross  products  of  the 
individual  terms.  These  products  are  zeros,  due  to  the  PN  and  the  instantaneous  frequency 
characteristics. 

The  only  non-zero  terms  in  the  first  and  second  double  summations  of  equation 
(19)  are  those  corresponding  to  n=k.  These  terms  take  unit  values.  As  such,  the  first  two 
double  summations  are  L.  The  third  and  forth  double  summations  take  the  value  8L.  This  is 
due  to  the  fact  that  the  only  nonzero  terms  are  those  corresponding  to  n=k.  Each  of  these 
terms  has  an  expected  value  of  1/2.  In  the  last  two  double  summations  there  are  L-2 
nonzero  terms  corresponding  to  n=k+2  and  n=k-2,  respectively.  The  fifth  double 
summation  can  be  simplified  to 

L  L 

-1-  8cos^(ffl„)  -t-  Scos^io)^)  + 1 6  cos^(tu„)cos^  (<»*)} 

n=l 

L  L  L 

=  4L^  -H  4L^  +  AI}  + 1651  £^{(0-5(1  +  2cos(2£oJ)^}  + 1651 

H=l  n=lk#n.k=l  (20) 

L 

=  12L^  -h  451  £{(1  +  2cos(2n>J  +  cos^toj}  +  4(L^  -  L)  =  12L^  +  6L  +  4L^  -  2L 
»=1 

=  161} +  2L 

By  adding  the  values  of  the  different  terms  constituting  E[Ui  ] ,  we  obtain 

E[C/f]  =  16L^  +  22L-2  (21) 

Now  we  focus  on  the  mean  square  value  of  the  correlator  output  due  to  noise,  which  is 
denoted  by  C/j 


2-15 


(22) 


L 

U2  =  -  2)  -  4w(n  -  l)cos(®  J  +  (2  +  4cos^(g)J)w(/i) 

11=1 


-  Aw{n  + 1)  cos((oJ  +  w(n  +  2)]p(n) 

Applying  the  expectation  operator  to  the  square  value  of  the  above  equation,  we  obtain 

L  L  L  L 

^[^2  ]  =  ^^E{pin)win  -  2)p(k)wik  -  2)}  +  ^'^E[p{n)w(n  +  2)p{k)w{k  +  2)} 

k=l  n=l  k=l  n=l 

L  L 

-l)p(k)w(k  -  l)cos(to„)cos(to^)} 

k=l  n=l 

L  L  (23) 

+  16  5^5^£:{p(n)w(n  +  l)p{k)w(k  +  l)C0S(6}„)C0S(fi}jt)} 

k=l  n=l 
L  L 

+  4cos(ffl„))(2  +  4cos(G)^))w(n)p(n)w(^)p(jt)} 

k=l  n=l 


Each  of  the  first  two  double  summations  yields  L  o^.  The  value  of  the  third  and  the  forth 
double  summations  is  8L  cP.  The  last  double  summation  is  14L  o^.  All  of  the  above 
values  were  obtained  based  on  the  assumption  that  the  PN  sequence  and  the  white  noise 
sequence  are  uncorrelated,  and  the  noise  is  a  white  random  stationary  process.  Since 


E[U,]  =  16L\  E[U2]  =  0, 

then 

var[£/i  ]  =  22L  -  2,  warlU^  ]  =  32Lo^ 
Substituting  (23)  and  (24)  in  (13),  we  obtain 


SNR^  = 


\ei} 

22L-2  +  32Lo^ 


For  reasonably  high  value  of  L,  the  above  expression  can  be  simplified  to 


(23) 

(24) 


(25) 


SNR„  =  -  (26) 

— +  2o^ 

16 

Case  3:  Two  three-coefficient  excision  filters 

Next,  we  apply  the  same  3-coefficient  filter  of  Case  1  to  both  the  input  sequence 
and  the  receiver  PN  sequence.  In  this  case,  the  correlator  output  is 


^  =  ^Po  i^)Po  ('*)  +  ('^)  +  X  ^0  ('')'^('*) 


>1=1 


>1=1 


>1=1 


(27) 
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Assuming  that  the  interference  is  completely  excised,  then  the  second  term  in  the  above 
summation  is  zero.  The  mean  value  of  the  correlator  output  is 


£[£/]  =  +  (28) 
where  the  first  term  is  due  to  the  signal  and  the  second  term  is  due  to  the  noise.  It  is  easily 
shown  that 

E[U^  ]  =  E['^{p(n-1)  +  p(n  + 1)  -2cos(cojp(n)){p(n-  l)  +  p(n  +  l)-  2cos(  coJp(  n))] 

H=l 

(29) 

E[UJ  =  E['^{w(n  - 1)  +  w(n  + 1)  -  2cos(cojw(n)){p(n  -l)  +  p(n  +  l)  -2cos(  cojp(n))] 
«=1 

(30) 

Using  the  uncorrelatedness  of  the  PN  sequence  and  the  noise  sequences,  the  mean  value  of 
the  correlator  output  signal  is  4L,  whereas  the  correlator  output  noise  is  zero. 


= si;4+ 4xz^(^(«  - 

k=\  11=1  *=1  n=l 

L  L 

+  l6Y,'^E{pin)p(n  +  l)p(k)pik  +  l)COS(fi)^)cOS(fi);t)} 

k-1  n=l 
L  L 

+  16  '^'^E[pin)pin  -l)p(k)pik  —  l)  cos(n)„ )  cos(  £0* )} 

k-l  n=l 

+  16'^'^E  {cos^  (£U„ )  cos^  (a>t )}  + 1 6^  Jr  ^  t  cos^  (to, )} 

»=1  *=1  «=1  k=l 

=  4L^  +  4L  4«L  +  8L  +  4L^  +  2L  +  8L^  =  16L^  +  22L 


L  L 


(31) 


Similarly,  it  can  be  shown  that 

E{Ul]  =  20Lo^ 

So,  SNR^=- 


22L-2  +  22La^ 

For  a  reasonable  high  value  of  L,  the  above  expression  can  be  approximated  by 


(32) 

(33) 


SNR  = 


22  20  , 
—  +  —  cr 
16  16 


Comparing  the  results  in  Case  1,  Case  2,  and  Case  3,  it  is  clear  that : 


(34) 
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1)  A  three-coefficient  filter  should  not  be  used  alone  under  the  stated  assumptions. 

2)  If  the  three  coefficient  filter  is  used,  then  a  similar  filter  should  be  applied  to  the  receiver 

PN  sequence. 

3)  Using  two  three-coefficient  filters  applied  to  the  received  signal  and  the  receiver  PN 
sequence  yields  a  higher  SNR  than  using  a  five-coefficient  filter  applied  to  the  received 
signal.  This  is  due  to  the  fact  that  a  longer  filter  encompasses  more  samples  of  the  white 
noise  sequence  and  as  such  provides  a  higher  noise  power  level  at  the  correlator  output. 

Conclusions 

A  combined  time-frequency  distribution  and  adaptive  filtering  approach  for 
interference  excision  is  presented,  the  ability  of  the  TFD  to  localize  the  time-varying 
frequency  is  utilized  to  estimate  the  instantaneous  frequency  of  the  jammer.  Based  on  this 
estimate,  an  FIR  filter  is  designed  which  places  an  infinite  null  at  the  jammer  frequency 
every  time  sample.  Three  different  cases  are  considered.  In  the  first  two  cases,  the  received 
signal  is  processed  by  a  three-  and  a  five-coefficient  filter,  respectively.  In  the  third  case,  a 
three-coefficient  filter  is  applied  to  both  the  received  signal  and  the  receiver  PN  sequence.  In 
each  case,  closed  form  expressions  for  the  improvement  of  the  SNR  are  derived.  The 
analytical  results  show  that  the  third  case  yields  the  highest  SNR. 
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DESIGNING  SOFTWARE  BY  REFORMULATION  USING  KIDS 

D.  Paul  Benjamin 
Assistant  Professor 
Computer  Science  Department 
Oklahoma  State  University 

Abstract 

In  recent  years,  much  attention  has  been  paid  to  designing  software  by  formal  methods. 
Such  an  approach  has  been  shown  to  facilitate  the  design  of  correct  software.  The  KIDS  sys¬ 
tem  utilized  at  Rome  Laboratory  is  an  example  of  a  software  design  system  based  entirely 
on  a  formal  approach  -  the  theory  of  institutions  developed  by  Joseph  Goguen  and  his 
coworkers.  KIDS  combines  a  theory  of  problem  solving  with  a  domain  theory  to  derive  a 
high-level  program,  which  can  then  be  transformed  to  improve  efficiency.  Frequently,  the 
problem  solving  theory  is  a  theory  of  global  search,  as  it  is  in  the  KTS  transportation  sched¬ 
uling  system.  Unfortunately,  although  global  search  through  the  space  of  possible  solutions 
may  be  suitable  for  some  problems,  it  is  extremely  inefficient  for  many  others.  In  many 
problems,  the  search  space  is  far  too  large  for  such  an  approach.  Rather,  it  is  desirable  to 
decompose  the  space  into  independent  components,  so  that  partial  solutions  can  be  found  in 
each  piece  and  combined  to  give  a  global  solution.  KIDS  does  support  problem  space  decom¬ 
position,  but  does  not  provide  methods  for  finding  useful  decompositions  for  specific  prob¬ 
lems.  This  paper  describes  how  a  theory  of  decomposition  and  reformulation  can  be  used  in 
conjunction  with  KIDS  to  derive  efficient  programs. 


4-2 


DESIGNING  SOFTWARE  BY  REFORMULATION  USING  KIDS 


D.  Paul  Benjamin 


Introduction 

In  recent  years,  much  attention  has  been  paid  to  designing  software  by  formal  methods. 
Such  an  approach  has  been  shown  to  facilitate  the  design  of  correct  software.  KIDS  (Kestrel 
Institute  Development  System)  is  an  example  of  a  software  design  system  based  entirely  on 
a  formal  approach  -  the  theory  of  institutions  developed  by  Joseph  Goguen  and  his  cowork¬ 
ers.  Two  of  the  research  groups  at  Rome  Labs,  the  KBSA  (Knowledge-Based  Software  Assis¬ 
tant)  group  and  the  KBP  (Knowledge-Based  Planning)  group,  use  KIDS  as  the  environment 
within  which  to  develop  software. 

The  emphases  of  planning  and  software  engineering  differ  in  many  ways,  e.g.,  software 
engineering  in  a  formal  environment  is  concerned  with  developing  a  program  that  is  prov- 
ably  correct  with  respect  to  a  given  specification,  while  planning  is  concerned  more  gener¬ 
ally  with  developing  a  program  that  has  a  high  probability  of  producing  a  specified 
behavior.  However,  despite  the  many  different  emphases  between  planning  and  software 
engineering,  both  possess  extremely  large  spaces  of  possible  designs.  Thus,  designers  in 
both  fields  are  faced  with  the  difficult  task  of  finding  efficient  designs.  A  system  such  as 
KIDS  helps  the  designer  to  make  clear  specifications,  to  write  correct  programs,  and  to 
refine  existing  code  to  improve  its  efficiency.  However,  no  existing  system  helps  the  designer 
find  efficient  designs  in  the  first  place. 

KIDS  combines  a  theory  of  problem  solving  with  a  domain  theory  to  derive  a  high-level 
program,  which  can  then  be  transformed  to  improve  efficiency.  Frequently,  the  problem  solv¬ 
ing  theory  is  a  theory  of  global  search,  as  it  is  in  the  KTS  transportation  scheduling  system. 
Unfortunately,  although  global  search  through  the  space  of  possible  solutions  may  be  suit¬ 
able  for  some  problems,  it  is  extremely  inefficient  for  many  others.  In  many  problems,  the 
search  space  is  far  too  large  for  such  an  approach.  Rather,  it  is  desirable  to  decompose  the 
space  into  independent  components,  so  that  partial  solutions  can  be  found  in  each  piece  and 
combined  to  give  a  global  solution.  KIDS  does  support  problem  space  decomposition,  but 
does  not  provide  methods  for  finding  useful  decompositions  for  specific  problems.  This  paper 


4-3 


describes  how  a  theory  of  decomposition  and  reformulation  can  be  used  in  conjunction  with 
KIDS  to  derive  efficient  programs. 

Example:  n-Queens 

The  following  brief  review  of  KIDS  is  intended  for  the  reader  with  some  working  knowl¬ 
edge  of  the  system.  Other  readers  are  urged  to  read  Smith(1990)  and  related  papers  to 
understand  how  KIDS  works.  This  example  is  drawn  from  Smith(1990), 

The  n-Queens  task  is  to  place  n  queens  on  an  nxn  chessboard  so  that  no  two  queens  are 
on  the  same  row,  column,  or  diagonal.  We  are  posed  the  task  of  enumerating  all  solutions  for 
a  given  value  of  n. 

1 
2 

3 

4 


The  KIDS  user  must  develop  a  theory  of  this  domain  that  specifies  the  legal  solutions.  A 
direct  way  to  do  this  is  to  specify  a  solution  as  satisfying  a  predicate  that  is  the  conjunction 
of  the  goal  conditions:  no-two-queens-on-same-row  &  no-two-queens-on-same-column  &  no- 
two-queens-on-same-up-diagonal  &  no-two-queens-on-same-down-diagonal.  In  Smith 
(1990),  it  is  shown  how  KIDS  combines  this  domain  theory  with  a  theory  of  global  search  to 
derive  a  correct  high-level  program.  KIDS  provides  optimizations  that  the  user  can  apply  to 
this  program.  The  process  of  theory  combination  can  be  viewed  as  forming  the  pushout  of 
the  domain  theory  with  the  global  search  theory.  DRIO  stands  for  the  domain  and  range  of 
the  function  to  be  synthesized,  and  the  input  and  output  conditions  that  specify  legal  inputs 
and  outputs. 

Domain 
Theory 


Global 

Search  - -  Program 


DRIO 

1 


Q 

Q 

Q 

Q 

12  3  4 
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The  goal  conditions  are  used  as  filters  to  reduce  the  search  space.  However,  even  with 
these  filters,  the  search  space  grows  exponentially  with  the  size  of  the  problem.  For  exam¬ 
ple,  in  the  4-queens  problem  shown  above,  there  is  no  solution  if  we  place  the  first  queen  in 
the  upper  left-hand  corner  or  lower-left  hand  corner,  but  the  system  cannot  know  this  until 
it  has  searched  all  ways  of  placing  the  remaining  queens.  This  exponential  growth  in  the 
number  of  possibilities  is  the  well-known  Achilles’  heel  of  problem  solving,  as  it  precludes 
the  efficient  solution  of  very  large  problems.  Global  search  cannot  be  used  to  enumerate  the 
solutions  to  the  million-queens  problem. 

Effective  search  heuristics  are  known  for  finding  a  single  solution  to  a  problem  of  that 
magnitude,  but  they  cannot  efficiently  enumerate  all  solutions.  This  prevents  consideration 
of  the  space  of  solutions,  and  in  particular  prevents  optimal  problem  solving.  More  seri¬ 
ously,  the  heuristic  approach  requires  engineering  a  new  heuristic  for  each  problem  class. 
This  does  not  even  partially  solve  the  designer’s  problem  of  finding  an  efficient  algorithm;  it 
merely  transforms  it  into  the  problem  of  finding  an  efficient  heuristic. 

What  the  designer  needs  is  a  method  for  finding  good  decompositions  of  the  problem.  A 
good  decomposition  of  the  problem  has  at  least  the  following  two  properties:  it  splits  the 
problem  into  components  such  that  solutions  within  the  components  compose  to  form  solu¬ 
tions  to  the  whole  problem,  and  it  applies  to  all  problems  in  the  class. 

KIDS  does  support  decomposition,  with  its  split  and  combine  operations.  However,  the 
only  two  forms  of  decomposition  implemented  are  first-rest  decomposition  and  half-half 
decomposition.  In  first-rest  decomposition,  the  first  part  of  the  problem  description  is  solved 
and  its  solution  is  combined  with  the  solution  of  the  rest  (this  constituting  a  recursive  call 
to  first-rest  decomposition.)  In  half-half  decomposition,  the  problem  description  is  split 
roughly  in  half,  and  solutions  of  each  half  are  combined  (again  leading  to  recursive  calls.) 
Mergesort  is  a  typical  example  of  half-half  decomposition,  while  bubble  sort  is  an  example 
of  first-rest  decomposition. 

These  forms  of  decomposition  are  not  generally  applicable.  For  example,  neither  of  these 
forms  of  decomposition  applies  to  the  n-queens  problem.  In  first-rest  decomposition,  we 
would  place  one  queen,  then  place  the  remaining  n-1  queens.  But  as  we  saw  above,  if  we 
place  the  first  queen  in  the  wrong  place  (a  corner),  the  1-queen  solution  will  not  compose 
with  any  solution  to  the  n-1  queens  to  give  a  valid  n-queen  solution.  Similarly,  half-half 
decomposition  would  find  any  solution  for  placing  2  queens,  and  compose  it  with  another  2- 
queen  solution  (which  could  be  a  copy  of  itself.)  This  need  not  be  a  solution  to  the  4-queen 
problem. 
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The  next  section  briefly  describes  a  domain-independent  theory  for  flnding  domain- 
dependent  decompositions.  The  reader  is  referred  to  Benjamin(1993)  and  Benjamin(1992) 
for  further  detail. 

Automatic  Decomposition  of  Theories 

The  remainder  of  this  paper  presents  a  method  for  automatically  reformulating  a  domain 
theory.  This  method  finds  the  local  symmetries  and  invariants  of  a  theory  and  uses  them  in 
two  ways: 

•  to  reformulate  the  theory  in  terms  of  the  composition  of  independent  subtheories, 
thereby  generating  a  hierarchy  for  synthesis  by  decomposition; 

•  to  inductively  construct  the  class  of  problems  in  which  this  decomposition  is  guaran¬ 
teed  to  work. 

The  decomposition  method  yields  a  set  of  components  that  are  associated  with  indepen¬ 
dent  features  of  the  problem. 

By  analyzing  the  local  symmetries  and  invariants  of  the  theory,  the  system  can  relate 
local  and  global  properties.  In  particular,  it  can  derive  how  local  solutions  can  be  composed 
to  give  valid  global  solutions.  By  computing  how  these  symmetries  can  be  transformed  into 
each  other,  the  system  inductively  defines  the  class  of  tasks  in  which  these  properties  are 
guaranteed  to  hold,  i.e.,  it  determines  how  the  solutions  scale  up. 

We  are  considering  a  task  theory  M  =  (Q,  A,  5)  consisting  of  a  set  A  of  actions  defined  as 
partial  functions  on  a  state  space  Q,  together  with  a  mapping  5:  Q  x  A  ^  Q  that  defines  the 
state  transitions.  We  are  not  concerned  with  the  syntactic  details  of  the  encoding  of  A,  but 
rather  with  which  actions  should  be  labeled  the  same  and  should  therefore  be  considered 
instances  of  a  common  abstraction.  In  other  words,  we  are  concerned  with  the  algebraic 
structure  of  A.  A  task  is  specified  by  a  pair  (i,g),  where  i:  1  — >  Q  maps  a  one-element  set  into 
Q  identifying  an  initial  state,  and  g:  1  Q  identifies  a  desired  state.  Further  details  are 
given  in  Benjamin  et  al.(1990).  Without  loss  of  generality,  we  can  restrict  our  attention  to 
semigroups  of  partial  1-1  functions  on  the  state  space  Q  (Howie,  1976). 

The  description  of  a  system  for  the  synthesis  of  a  plan  (or  control)  for  M  differs  from  the 
description  of  M  in  at  least  one  essential  way:  the  process  of  planning  an  action  is  reversible 
whether  or  not  the  action  itself  is  reversible  (assuming  the  synthesizing  system  can  back¬ 
track.)  Thus,  the  process  of  synthesizing  plans  for  a  given  =  (Q,  A^,  S^)  can  be  described 
by  an  automaton  whose  actions  form  an  appropriate  inverse  semigroup  containing  Aj, 
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together  with  newly  added  inverse  corresponding  to  backtracking.  To  define  this  automaton 
properly  it  is  necessary  to  avoid  the  situation  where  a  new  inverse  is  applied  (an  action  is 
retracted)  when  that  action  was. not  previously  applied,  e.g.,  moving  a  pawn  backwards 
when  the  last  action  was  not  moving  it  forwards.  To  do  so,  we  define  the  synthesizing 
automaton  as  a  pushdown  automaton  M2  defined  as  follows: 

The  state  set  of  M2  is  Q  x  83* , 

The  action  set  of  M2  is  A2  =  Aj  augmented  by  inverses  for  the  non-invertible  actions  in  Aj, 
The  pushdown  alphabet  B2  =  A^, 

The  state  transition  map  82  pushes  actions  on  which  is  it  defined  onto  the  stack.  82  is 
defined  to  act  like  8^  on  actions  in  Aj,  and  to  invert  a  non-invertible  action  in  Aj  only  when 
that  action  is  on  top  of  the  pushdown  store.  Otherwise,  82  is  undefined. 

In  the  remainder  of  this  paper,  we  will  consider  pushdown  automata  for  plan  synthesis. 

To  analyze  the  structure  of  such  a  semigroup  of  transformations,  a  natural  step  is  to 
examine  Green's  relations  (Lallement,  1979).  Green's  equivalence  relations  are  defined  as 
follows:  given  any  semigroup  S: 

aRb  iff  aS^  =  bS^  aLb  iff  S^a  =  S^b  aJb  iff  S^aS^  =  S^bS^ 

H  =  RnL  D  =  RvL 

where  denotes  the  monoid  corresponding  to  S  (S  with  an  identity  element  1  adjoined)  and 
V  denotes  the  join  of  two  relations.  Intuitively,  we  can  think  of  these  relations  in  the  fol¬ 
lowing  way:  aRb  iff  for  any  plan  that  begins  with  “a”,  there  exists  a  plan  beginning  with  “b” 
that  yields  the  same  behavior;  aLb  iff  for  any  plan  that  ends  with  “a”,  there  exists  a  plan 
ending  with  “b”  that  yields  the  same  behavior;  aHb  indicates  functional  equivalence,  in  the 
sense  that  for  any  plan  containing  an  “a”  there  is  a  plan  containing  “b”  that  yields  the  same 
behavior;  two  elements  in  different  D-classes  are  functionally  dissimilar,  in  that  no  plan 
containing  either  can  exhibit  the  same  behavior  as  any  plan  containing  the  other.  Utilizing 
Green’s  relations,  Lallement  defines  a  coordinate  system  for  a  semigroup: 

Definition.  Let  D  be  a  D-class  of  a  semigroup,  and  let  H^p  (A,e  A,  pe  P)  be  the  set  of  H- 
classes  contained  in  D  (indexed  by  their  L-class  and  R-class).  A  coordinate  system  for  D 
is  a  selection  of  a  particular  H-class  Hq  contained  in  D,  and  of  elements  q^p,  q';Lp»  ^xp,  r'xp 
e  with  X.  e  A,  p  e  P  such  that  the  mappings  x  — >  qxpXr^p  and  y  — >  ^'xpy*^xp  bijec- 
tions  from  Hq  to  H^p  and  from  H^p  to  Hq,  respectively.  A  coordinate  system  for  D  is 
denoted  by  [Ho;{(qxp,  q'xp,  rxp,  r'xp):  A,  e  A,  p  e  P)]. 
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Each  coordinate  system  gives  a  Rees  matrix  representation  in  much  the  same  way  that  a 
coordinate  system  in  a  vector  space  gives  a  matrix  representation,  permitting  us  to  change 
coordinates  within  a  semigroup  by  performing  a  similarity  transformation  (inner  automor¬ 
phism)  in  the  usual  way  (the  reader  is  referred  to  Lallement  for  details).  Each  local  coordi¬ 
nate  system  within  the  semigroup  expresses  a  distinct  structuring  of  a  subtask,  as  we  can 
choose  a  point  in  Q  to  be  the  “origin”,  and  label  points  in  Q  according  to  the  actions  that 
map  the  origin  to  them.  As  we  vary  the  coordinate  system,  each  action  varies  through  an 
equivalence  class  of  actions.  Any  local  property  that  is  invariant  under  all  local  coordinate 
transformations  is  a  property  of  the  subtask  itself,  and  hence  is  an  abstract  property.  Any 
property  that  is  not  invariant  under  coordinate  transformation  is  an  implementation  prop¬ 
erty.  In  particular,  the  abstract  names  for  actions,  which  should  not  reflect  implementation 
distinctions,  can  be  determined  by  factoring  a  given  automaton  by  its  group  of  local  coordi¬ 
nate  transformations.  The  quotient  semigroup  thus  obtained  is  the  abstract  language  imple¬ 
mented  in  the  given  automaton. 

The  morphism  from  the  D-class  of  the  original  semigroup  to  its  abstract  quotient  is  a 
partial  morphism.  The  inverse  of  this  morphism  specifies  the  implementation  of  each 
abstract  action  in  terms  of  sets  of  sequences  of  actions  of  the  original  semigroup. 

Example:  The  Towers  of  Hanoi 

Let  us  number  the  nine  states  of  the  2-disk  Towers  of  Hanoi  as  follows: 


Let  the  two  possible  actions  be  denoted  by  “x”  and  “y”. 
x=123456789  y=123456789 

2  3  1  5  6  48  9  7  4  8  3 

“x”  moves  the  small  disk  right  one  peg  (wrapping  around  from  peg  3  to  peg  1),  and  “y” 
moves  the  large  disk  one  peg  to  the  left  (wrapping  around  from  peg  1  to  peg  3).  In  the  figure, 
X  is  shown  by  narrow,  counterclockwise  arrows,  and  “y”  is  shown  by  thick,  counterclock¬ 
wise  arrows.  These  two  actions  generate  a  semigroup  of  31  distinct  partial  functions  on  the 


4-8 


states.  Green’s  relations  for  this  semigroup  are: 


D1 


X,  XX,  XXX 


xyx 

xy 

xyxx 

xyxxyx 

xyxxy 

xyxxyxx 

xyxxyxxyx 

xyxxyxxy 

xyxxyxxyxx 

xxyx 

xxy 

xxyxx 

xxyxxyx 

xxyxxy 

xxyxxyxx 

xxyxxyxxyx 

xxyxxyxxy 

xxyxxyxxyxx 

yx 

y 

yxx 

yxxyx 

yxxy 

yxxyxx 

yxxyxxyx 

yxxyxxy 

yxxyxxyxx 

There  are  three  D  classes,  shown  as  the  three  separate  large  boxes.  In  each  D  class,  the  R 
classes  are  rows  and  the  L  classes  are  columns,  and  they  intersect  in  the  small  boxes,  which 
are  H  classes.  Note  that  DO  and  D1  consist  of  only  one  R  class  and  one  L  class,  and  hence 
one  H  class.  The  idempotents  are  in  bold  type. 

There  are  no  nontrivial  inner  automorphisms  of  DO  and  Dl.  The  group  of  inner  automor¬ 
phisms  of  D2  is  a  cyclic  group  of  order  three.  These  coordinate  transformations  are  global 
within  D2,  but  are  local  in  the  semigroup.  These  inner  automorphisms  are  calculated  by  the 
matrix  techniques  explained  by  Lallement  (1979).  A  generator  for  this  group  is  the  automor¬ 
phism  that  maps  xyx  to  xxy,  xxy  to  5^x,  and  yxx  to  X3rx.  Factoring  D2  by  this  map  gives: 

Define  z  =  case  {  little  disk  left  of  large  disk:  xxy 
little  disk  on  large  disk:  xyx 

little  disk  right  of  large  disk:  yxx  } 

where  z  is  a  new  symbol. 

X  is  as  before,  and  z  moves  both  disks  left  one  peg.  z  is  a  macro-action,  which  is  imple¬ 
mented  as  a  disjunction  of  sequences  of  actions,  x  and  z  are  independent  controls;  x  solves 
the  position  of  the  small  disk,  and  z  solves  the  big  disk,  x  does  not  change  the  position  of  the 
big  disk,  and  z  does  not  change  the  relative  positions  of  the  disks.  The  disks  can  be  solved  in 
either  order,  as  the  new  representation  is  an  abelian  group.  This  new  representation  cap- 
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tures  the  important  property  of  the  Towers  of  Hanoi  task:  the  disks  can  be  solved  in  any 

order.  In  the  original  theory,  this  property  was  obscured  by  details  of  the  implementations 
of  the  disk  moves. 

This  decomposition  applies  not  only  to  the  two-disk  problem,  but  to  all  Towers  of  Hanoi 
problems.  This  is  seen  by  forming  free  products  of  the  semigroup  with  itself,  amalgamating 
the  coordinate  axes  in  all  possible  ways.  There  are  three  free  products  of  this  semigroup 
with  itself  that  amalgamate  coordinates:  D2  can  be  identified  with  itself,  D1  with  itself,  and 
D2  with  D1  (by  mapping  xxy,  xyx,  and  yxx  to  x).  These  correspond  to  identifying  the  moves 
of  the  larger  disk  of  one  copy  of  the  semigroup  with  the  moves  of  the  larger  disk  in  another 
copy,  identifying  the  moves  of  the  smaller  disk  with  the  moves  of  the  smaller  disk,  and  iden¬ 
tifying  the  moves  of  the  larger  disk  with  the  moves  of  the  smaller  disk,  respectively.  The 
result  of  this  construction  is  the  three-disk  Towers  of  Hanoi,  which  is  viewed  as  three  copies 
of  the  two-disk  task  running  concurrently. 


From  the  logical  perspective,  this  can  be  viewed  as  composing  three  copies  of  a  theory  for 
the  two-disk  Towers  of  Hanoi,  to  yield  a  valid  theory  for  the  three-disk  problem.  Copies  of 
the  theory  are  joined  by  unifying  variables  between  theories.  Two  copies  joined  at  the  mid¬ 
dle  disk  will  not  suffice,  as  then  the  largest  disk  could  be  placed  on  the  smallest  disk.  A 
third  copy  of  the  theory  prohibits  this.  The  purpose  of  computing  a  coordinate  system  is  that 
the  coordinate  axes  determine  what  to  unify. 

The  interaction  of  these  three  smaller  tasks  yields  a  set  of  relations.  For  example,  con¬ 
sider  moving  the  middle  disk  one  peg  to  the  right.  When  considering  this  disk  as  the  larger 
disk  in  the  task  consisting  of  the  upper  two  disks,  this  move  is  x3rxxy  (in  the  state  when  all 
disks  are  on  the  left  peg),  yxxyx  (when  the  smallest  disk  is  to  the  right  of  the  middle  disk), 
or  xx5ocxyxx.  On  the  other  hand,  when  considering  this  disk  as  the  smaller  disk  in  the  task 
consisting  of  the  lower  two  disk,  this  move  is  x  (in  the  other  copy  of  the  semigroup).  This 

means  that  we  add  the  relation  xyxxy  =  yxxyx  =  xxyxxyxx  to  the  presentation  of  the  semi¬ 
group. 

This  method  of  composing  larger  tasks  from  smaller  ones  guarantees  that  this  decompo- 
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sition  generalizes  to  any  Towers  of  Hanoi  problem  with  n  disks,  by  considering  a  coordinate 
system  for  the  three-disk  task,  and  forming  free  products  with  amalgamation  in  the  same 
manner  as  before.  These  free  products  with  amalgamation  will  always  reduce  to  free  prod¬ 
ucts  with  amalgamation  of  coordinate  systems  of  the  two-disk  task,  so  that  by  induction  the 
properties  of  the  two-disk  task  determine  the  properties  of  all  Towers  of  Hanoi  tasks. 

Example:  Robot  Path  Planning 

Let  us  consider  a  simple  robot  that  moves  on  a  hexagonal  grid.  Each  grid  point  is  labeled 
by  its  row  and  column.  The  robot's  orientation  can  be  north  (N),  northeast  (NE),  southeast 
(SE),  south  (S),  southwest  (SW),  or  northwest  (NW).  It  has  six  basic  motions:  forward  (F), 
forward  right  (R),  forward  left  (L),  backwards  (b),  backwards  right  (r),  and  backwards  left 
(1).  These  motions  are  shown  in  Figure  1. 


(1.1) 

Figure  2. 


The  states  of  this  robot  split  into  two  disjoint  sets:  any  position  with  orientation  N,  SE, 
or  SW,  and  any  position  with  orientation  S,  NE,  or  NW.  No  state  in  either  set  is  reachable 
from  the  other.  We  will  examine  only  the  first  set,  as  the  other  is  isomorphic.  Let  us  exam¬ 
ine  the  motion  of  this  robot  on  a  2x2  grid,  shown  in  Figure  2.  There  are  twelve  states  in  Q. 
The  transformation  semigroup  A  consists  of  all  distinct  state  mappings  generated  by  {F,  R, 
L,  b,  r,  1},  together  with  0  (the  empty  mapping).  Green’s  relations  in  this  semigroup  are 
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shown  below. 
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Green’s  relations  are  used  to  construct  a  Rees  matrix  representation  for  the  semigroup, 
and  for  the  global  and  local  inner  automorphisms  (the  coordinate  transformations.)  These 
details  are  omitted  here,  due  primarily  to  space  considerations.  Factoring  this  semigroup  by 
its  local  coordinate  transformations  gives  the  familiar  decomposition  of  path  planning  into 
planning  changes  in  position  and  then  planning  changes  in  orientation.  (More  precisely,  the 
semigroup  is  the  normal  extension  of  the  subsemigroup  that  holds  orientation  invariant  by 
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a  group  of  orientation  changes  that  holds  position  invariant.) 

Now  let  us  see  how  this  decomposition  scales  up  to  all  path  planning  on  any  such  grid. 

There  are  twelve  free  products  of  this  semigroup  with  itself  that  amalgamate  local  coor 
dinates. 


a  h 

Figure  3.  The  twelve  ways  of  compos¬ 
ing  2x2  grids.  This  ‘llower**  shape 
contains  twelve  2x2  grids,  each  of 
which  intersects  every  other  in  at 
least  one  point,  the  center  point. 

For  example,  the  state  (2,1,N)  of  the  bold  grid  in  Figure  3a  corresponds  to  a  different 
state  in  each  of  the  other  eleven  2x2  grids  in  the  figure.  For  example,  it  corresponds  to  the 
state  (2,1,SE)  in  the  bold  2x2  grid  in  Figure  3b.  In  this  way,  each  state  in  the  2x2  grid  can 
potentially  correspond  to  any  of  the  other  eleven  states  in  another  2x2  grid.  For  example, 
the  motion  RRFrLbll,  which  maps  (2,1,N)  to  (2,1,SW)  must  be  given  the  same  relabeling  as 
LbllbLrF,  which  maps  (2,1,SE)  to  (2,1,N).  Continuing  in  this  way,  we  derive  the  following: 
FRRF  =  rLbl  =  IbLr  =  RFrL  =  bllb  =  LrFR  =  RRFrLbll  =  bLrFRRfr  =  LbllbLrF  =  FrLbllbL  = 
rFRRFrLb  =  llbLrFRR.  These  twelve  strings  are  the  motions  that  rotate  the  robot  counter¬ 
clockwise  one  turn,  while  holding  the  grid  position  invariant.  Similarly,  the  clockwise 
motions,  and  the  motions  that  hold  the  orientation  invariant  while  changing  position  map 
together. 

This  can  be  viewed  as  composing  twelve  copies  of  a  theory  for  the  2x2  grid  to  yield  a  valid 
theory  for  moving  on  a  larger  grid.  Copies  of  the  theory  are  joined  by  unif3dng  variables 
between  theories.  The  purpose  of  computing  a  coordinate  system  is  that  the  coordinate  axes 
identify  what  to  unify.  When  the  small  grid  is  a  subtask  of  a  larger  grid,  the  actions  on  the 
small  grid  must  be  able  to  be  formulated  as  actions  on  the  large  grid.  So,  when  considering 
all  ways  of  composing  grids  to  make  larger  grids,  we  are  considering  all  ways  of  formulating 
small  grid  actions  as  larger  grid  actions,  i.e.,  all  ways  of  transforming  coordinates  within 
the  small  grid. 
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Thus,  this  method  of  composing  larger  tasks  from  smaller  ones  guarantees  that  the 
abstract  properties  of  the  small  grid  scale  up  to  any  grid  composed  from  small  grids,  and  in 
particular  that  the  decomposition  of  the  small  grid  generalizes  to  this  infinite  class  of  grids. 
This  justifies  the  use  of  this  path  planning  algorithm  on  any  such  grid.  If  any  of  the  compo¬ 
nent  semigroups  resulting  from  this  reformulation  are  too  large,  then  the  method  is  applied 
recursively,  yielding  a  hierarchical  decomposition.  The  method  is  terminated  when  the  sem¬ 
igroups  at  the  leaf  nodes  are  considered  sufficiently  small  to  be  searched,  or  are  recognized 
as  previously  solved  subtasks. 

Example:  N-Queens 

We  now  return  to  the  n-Queens  problem,  to  show  how  the  reformulation  method  leads  to 
a  program  for  enumerating  the  solutions  to  the  problem  without  considering  any  non-solu¬ 
tions.  In  this  section,  we  will  not  show  the  computation  of  Green’s  relations,  as  this  is  repet¬ 
itive.  Instead,  we  will  show  the  local  and  global  symmetries  pictorially. 

The  next  figure  shows  a  solution  to  the  4-queens  problem.  (The  other  solution  is  obtained 
by  a  flip  symmetry  of  this  one.)  This  solution  has  a  hierarchy  of  local  symmetries.  The  top- 
level  local  symmetry  is  that  of  the  3x4  subproblems,  but  the  four  3x4  subproblems  in  the 
4x4  are  mapped  to  each  other  only  by  the  cyclic  global  symmetry  of  order  4.  The  first  new 
local  symmetries  arise  from  considering  the  2x3  subproblem  that  contains  2  queens.  The 
four  instances  of  this  subproblem  are  of  course  also  mapped  to  each  other  by  the  cyclic  glo¬ 
bal  symmetry,  but  also  map  to  themselves  by  a  local  symmetry  of  order  2.  The  next  figure 
shows  these  four  instances,  together  with  the  1x2  subproblems  that  arise  from  intersecting 
the  2x3  subproblems: 


As  in  the  other  problems  considered,  we  construct  larger  instances  of  the  n-queens  prob¬ 
lem  from  copies  of  the  4x4  solution,  amalgamating  the  local  symmetries.  For  example,  the  6- 
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queens  solution  can  be  constructed  from  four  copies  of  the  4-queens  solution  by  amalgamat 
ing  2x3  subproblems: 


The  6-queens  solution  can  be  viewed  as  three  4-queens  solutions  in  this  way. 

Stated  another  way,  the  construction  of  the  6-queens  solution  from  multiple  4-queens 
solutions  can  be  thought  of  as  follows:  take  2  copies  of  the  4-queens  solution  and  amalgam¬ 
ate  a  2x3  subproblem  in  one  copy  with  a  2x3  in  the  other.  There  are  6  distinct  queens.  The 
four  queens  in  the  first  copy  are  guaranteed  to  be  on  their  own  rows,  columns,  and  diago¬ 
nals,  and  the  four  queens  in  the  second  copy  are  similarly  guaranteed  to  satisfy  the  solution 
conditions.  But  we  are  not  guaranteed  that  the  two  non-amalgamated  queens  in  one  copy 
are  on  distinct  rows,  columns,  or  diagonals  from  the  two  non-amalgamated  queens  in  the 
other  copy.  It  is  necessary  that  those  four  queens  also  satisfy  the  solution  conditions,  i.e., 
we  need  them  to  be  a  4-queens  solution  by  themselves.  Therefore,  composing  three  4-queens 
solutions  in  this  way  is  both  necessary  and  sufficient  to  give  a  6-queens  solution,  which 
implies  that  ail  6-queens  solutions  are  constructed  in  this  way. 

Thus,  we  can  construct  n-queens  solutions  from  smaller  solutions.  It  is  not  necessary  to 
search  a  space  containing  non-solutions.  Instead,  analysis  of  the  structure  of  the  base  case 
reveals  both  the  decompositions  of  the  base  case  and  the  inductive  definition  of  the  class  of 
problems  with  the  same  structure.  Ideally,  a  system  such  as  KIDS  should  have  the  capabil¬ 
ity  to  perform  this  analysis,  to  synthesize  programs  that  construct  the  solution  space 
directly  in  this  fashion  whenever  possible. 
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Summary 

The  analysis  of  the  structure  of  a  representation  can  lead  to  both  a  greater  understand¬ 
ing  of  the  task  and  to  techniques  for  system  design  and  implementation.  Identification  of 
the  invariants  and  symmetries  of  a  representation  yields  decompositions  of  the  representa¬ 
tion.  The  definition  of  coordinate  systems  for  transformation  semigroups  gives  a  method  for 
automatic  design  of  a  representation  for  a  class  of  tasks. 

KIDS  supports  the  divide-and-conquer  approach  to  problem  solving,  but  needs  a  tech¬ 
nique  for  automatic  identification  of  useful  decompositions.  This  work  has  completed  a  the¬ 
ory  of  decomposition,  and  demonstrated  how  it  can  be  used  to  reformulate  KIDS  domain 
theories  to  find  effective  algorithms. 
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Abstract 

High-frequency  radio  waves  reflected  from  the  Earth's  ionosphere  can  be  used  to  illuminate 
targets  that  are  several  thousand  kilometers  from  the  transmitter.  Much  of  the  early  research  and 
development  of  this  over-the-horizon  (OTH)  technique  was  conducted  during  the  1950s  at  the  Air 
Force  Rome  Laboratory  (then  called  the  Rome  Air  Development  Center)  and  at  the  Naval  Research 
Laboratory.  Today,  OTH  radar  technologies  play  a  central  role  in  the  DoD  Counterdrug  Program, 
where  the  primary  mission  is  the  detection  of  small  targets  with  relatively  low  radial  velocity.  The 
ionosphere,  however,  is  a  highly  variable  medium  containing  a  wide  range  of  irregularities,  whose 
structure  and  behavior  is  dependent  on  many  factors.  In  many  situations,  these  irregularities  can 
have  a  deleterious  effect  on  the  signals  received  by  HF  OTH  radar  systems.  Although  ionospheric 
plasma  irregularities  are  present  to  some  degree  at  all  latitudes,  they  are  most  prevalent  at  night  and 
in  the  equatorial  and  auroral  regions.  Their  effect  is  to  broaden  the  Doppler  spectrum  of  the 
backscattered  clutter  which  in  turn  masks  the  signature  of  low-velocity  targets  with  small  radar 
cross  section.  Several  strategies  have  been  proposed  to  mitigate  the  effect  of  these  irregularities  but 
no  satisfactory  solution  has  yet  been  adopted  for  operational  systems.  One  possible  solution  to  this 
problem  is  the  application  of  phase  encoding  of  the  transmitted  signal  and  the  appropriate  decoding 
of  the  received  backscatter  echoes.  The  implementation  of  this  solution  was  carried  out  by 
Decision-Science  Applications,  Inc.  under  contract  to  Rome  Laboratory  and  the  first  experimental 
field  tests  were  carried  out  in  June  and  July  of  1994  using  in-house  facilities.  This  report 
discusses  some  of  the  initial  results  obtained  during  these  field  tests. 
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The  application  of  quadratic  phase  Coding  to  OTH  radar  Signals 


FRANK  T.  BERKEY 

Introduction 

For  several  decades,  high-frequency  (HF)  radio  waves  have  been  used  to  study  the  ionosphere  using  vertical 
incidence  ionospheric  radars  known  as  ionosondes  [e.g.  Davies,  1991].  These  instruments  transmit  vertically  into 
the  ionosphere  and  sweep  through  the  HF  spectrum  at  frequencies  from  «1  to  20  MHz,  recording  the  time-of-flight  of 
echoes  reflected  from  the  various  ionospheric  layers  as  a  function  of  frequency.  The  time-of-flight  can  be  converted 
to  the  "virtual"  height  of  the  ionosphere  and  the  frequency  at  which  wave  retardation  occurs  directly  interpreted  as  the 
critical  plasma  density  of  the  particular  ionospheric  layer  through  the  relation  —  f  /  81,  where  is  the  electron 

density  per  cubic  meter  and  /  is  the  probing  frequency  in  Hertz.  The  measurements  from  these  instruments, 
distributed  in  a  loose-knit  network  of  global  stations,  have  been  (and  currently  are)  used  to  infer  a  wide  variety  of 
information  about  the  morphology  of  the  ionosphere.  Since  the  ionospheric  plasma  density  is  a  function  of  many 
variables,  such  as  latitude,  time-of-day,  season  and  solar  cycle,  it  is  not  easily  characterized  by  mathematical  func¬ 
tions  except  on  a  very  coarse  time  scale.  Thus,  synoptic  measurements  from  the  global  ionosonde  network  have 
been  used  to  determine  the  ionospheric  "weather"  on  both  micro-  and  macro-scales 

To  obtain  information  about  the  ionosphere  at  locations  other  than  overhead,  the  technique  of  oblique 
sounding  was  developed,  which  provided  data  at  the  ionospheric  mid-point  between  transmitter  and  receiver  [see  e.g. 
Croft,  1972].  Subsequently,  it  was  recognized  within  the  defense  community  that  targets  beyond  the  horizon  could 
be  illuminated  using  high  power  HF  radio  waves  that  have  been  reflected  one  or  more  times  from  the  ionosphere. 
This  technique,  known  as  over-the-horizon  (OTH)  radar,  was  developed  in  the  United  States  at  the  Air  Force  Rome 
Laboratory  (RL)  and  the  Naval  Research  Laboratory  starting  in  the  1950s  [Stebbins,  1992].  The  research  and  devel¬ 
opment  carried  out  under  the  auspices  of  these  laboratories  led  to  the  deployment  of  operational  OTH  systems 
[Headrick,  1974,  1990a,b].  The  same  technique  was  also  employed  by  the  former  Soviet  Union  [Kolosov,  1987; 
Estravkov,  1994]  and  more  recently  in  Australia  [Anderson,  1986, 1992;  Earl  and  Ward,  1987].  OTH  radar  systems 
were  initially  utilized  for  the  detection  and  early  warning  of  long-range  bomber  and/or  cruise  missiles  launched  in 
hostile  attacks  against  the  United  States.  To  that  end,  the  first  two  operational  OTH  systems,  deployed  on  the  east 
and  west  coasts,  were  intended  to  monitor  the  ionosphere  above  the  former  Soviet  Union.  After  the  end  of  the  Cold 
War  in  1989,  the  Soviet  threat  disappeared  as  did  the  primary  mission  of  the  OTH-B  radars.  In  place  of  more 
conventional  airborne  radar  systems,  OTH  radar  technology  has  recently  been  utilized  in  the  DoD/ARPA 
Counterdrug  program  to  detect  the  small,  low  speed,  maneuvering  aircraft  that  are  typically  used  to  transport 
contraband  drugs  from  South  and  Central  America  into  the  U.S. 

Two  OTH  radar  systems  are  currently  operational  within  the  U.S.  The  Air  Force  OTH  radar,  designated 
AN/FPS-118,  is  sited  near  Bangor,  Maine  and  has  been  in  operation  since  1985  [Boutacoff,  1985].  The  field-of-view 
(FOV)  of  this  OTH  radar  encompasses  look  angles  over  the  South  American  continent  although  this  was  not  the 
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intended  function  of  the  system.  In  1992,  the  U.S.  Navy  deployed  an  OTH  radar  (designated  AN/TPS-71)  at  a 
location  near  Chesapeake,  Virginia  (this  system  is  also  referred  to  as  ROTHR).  The  primary  FOV  of  this  radar  is 
directed  over  South  America;  ten  angular  cones  of  HPBW  are  illuminated  by  the  transmitter,  each  of  which  is 
sampled  using  16  narrow  receive  beams.  Two  additional  AN/TPS-71  radars  are  to  be  installed,  one  in  Texas  and  the 
other  at  a  site  yet  to  be  determined.  The  deployment  of  an  advanced  OTH  (AOTH)  radar  in  an  offshore  theater  is 
currently  under  consideration. 

The  range  window  sampled  by  an  OTH  radar  is  governed  to  a  large  extent  by  the  propagation  conditions 
existing  at  the  time  of  transmission,  but  generally  covers  between  900  and  3700  km  via  a  single  ionospheric  reflec¬ 
tion.  Targets  at  relatively  short  ranges  are  illuminated  by  HF  radio  wave  propagation  via  the  ionospheric  £-layer  (at 
-120  km),  whereas  F-layer  (-250  km)  propagation  enables  target  detection  at  greater  distances.  The  radar  cross  sec¬ 
tion  (RCS)  of  the  target  is  also  important  since  it  is  a  function  of  the  frequency  transmitted  by  the  radar.  At  night, 
for  example,  the  ionosphere  supports  HF  propagation  at  lower  frequencies  than  during  the  day  when  ionospheric 
densities  are  higher.  Hence,  the  same  target  will  have  a  smaller  RCS  when  illuminated  at  night  because  of  the 
longer  wavelengths  that  are  needed  to  sustain  long  distance  propagation.  A  further  advantage  of  HF  OTH  technology 
is  its  effectiveness  against  stealth  aircraft. 

The  OTH  radar  systems  deployed  in  the  U.S.  transmit  a  frequency  modulated,  continuous  wave  (FMCW) 
wave  form  at  power  outputs  ranging  from  200-1200  kW.  The  transmitter  and  receiver  sites  are  spatially  separated, 
typically  by  distances  on  the  order  of  «200  km,  since  a  continuous  wave  form  is  transmitted.  Note  that  co-sited 
OTH  systems  have  recently  been  advocated  for  the  proposed  French  NOSTRADUMUS  OTH  system  [Casamayou, 
1991;  Saillant,  1992].  Large  aperture  arrays  with  narrow,  steerable  beams  have  been  developed  for  receiving  [e.g. 
Washburn  and  Sweeney,  1976].  For  example,  the  wide  aperture  research  facility  (WARF)  operated  by  Stanford 
Research  Institute  (SRI),  employs  a  linear  receiving  antenna  array  2.5-km  in  length  containing  256  vertical 
monopole  elements,  each  5.5m  tall.  MITRE  Corporation  has  recently  constructed  a  3-km  diameter  array  of  96  short 
(6m)  vertical  monopole  elements  near  Odessa,  Texas  [e.g.  Rastogi  et  aL,  1994].  Each  of  the  monopole  elements  in 
this  sparse  array  is  connected  to  a  digital  receiver  and  numerical  beamforming  is  carried  out. 

In  most  OTH  ^plications  the  Doppler  frequency  spectrum  of  the  backscattered  signal  is  the  fundamental 
data  product.  lonospherically  reflected  backscatter  is  received  from  both  land  and  sea  surfaces  and  is  typically  30-70 
dB  stronger  than  an  aircraft  target  [Bamum,  1993].  Land  reflections  have  zero  Doppler  shift  and  are  often  character¬ 
ized  by  echoes  from  mountain  ranges,  large  urban  areas,  and  long,  elevated  power  lines.  Backscatter  from  the  ocean 
has  a  characteristic  Doppler  spectrum  due  to  surface  waves  which  generate  the  so-called  Bragg  lines  at  ±0.38  Hz 
[Crombie,  1955;  Wyatt,  1990].  Such  spectra  have  been  utilized  to  derive  surface  wind  maps  over  a  wide  area  of  the 
ocean  [Georges  and  Harlan,  1992;  Atlas  and  Weinman,  1992]  but  generally  require  long  radar  dwell  times.  The 
difference  in  reflectance  at  the  land-sea  boundary  also  produces  a  unique  backscatter  signature.  A  moving  target  with 
sufficient  radial  velocity  will  produce  a  spectral  signature  in  the  Doppler  regime  that  is  distinguishable  from  other 
sources  of  clutter. 
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To  a  first  ^proximation,  the  ionosphere  can  be  considered  as  a  perfectly  reflecting  mirror  in  which  case  the 
backscattered  signals  undergo  little  Doppler  "smearing"  and  have  relatively  narrow  peaks  in  the  Doppler  frequency 
regime.  In  practice,  however,  the  ionosphere  is  frequendy  both  in  motion  and  highly  structured  which  has  the  effect 
of  broadening  the  backscatter  signal  in  the  Doppler  regime.  The  occurrence  of  backscatter  echoes  with  a  broad 
Doppler  spectrum  is  termed  spread  Doppler  clutter  (SDC)  by  the  OTH  radar  community  and  it  has  a  deleterious  effect 
on  operational  systems.  The  ionospheric  irregularities  (generally  referred  to  as  "spread-F")  responsible  for  this  clutter 
are  most  prevalent  in  the  auroral  and  equatorial  regions  but  also  occur  at  mid-latitude,  particularly  at  night. 

The  backscattered  OTH  echoes  undergo  several  forms  of  signal  contamination  and  distortion  that  can  be 
identified  with  known  physical  processes  occurring  in  the  ionosphere  [Anderson,  1992].  These  processes  result  in 
errors  of  up  to  ±20  km  in  the  location  of  aircraft  targets.  The  primary  sources  are: 

•  Slow  phase  path  variations  caused  by  traveling  ionospheric  disturbances  and  other  large-scale  phenomena; 

•  Discrete  multi-mode  propagation  via  E,  Fi,  F2  and  hybrid  paths  to  and  from  the  target  area; 

•  Spectral  broadening  due  to  small-scale  plasma  turbulence; 

•  High  angle  rays,  with  strongly  range-dependent  Doppler  shifts; 

•  Impulsive  noise  from  thunderstorms  in  the  lower  atmosphere; 

•  Man-made  radio  frequency  interferaice; 

•  Echoes  from  meteors,  auroras,  field-aligned  plasma  striations  and  equatorial  plasma  instabilities; 

•  Ground  clutter  received  through  sidelobes,  back  lobes  and  two-hop  contributions. 

Ionospheric  spread-F  in  the  equatorial  region  is  a  major  source  of  SDC  for  OTH  systems  in  both  the  U.S. 
and  Australia.  The  equatorial  region  is  broadly  defined  as  that  region  within  ±20^  of  the  geomagnetic  equator  where 
anomalously  large  electron  densities  are  observed  (i.e.  the  Appelton  anomaly).  On  the  75°W  meridian,  the  magnetic 
equator  occurs  at  «10°S  latitude,  or  «6000  km  down  range  from  the  RL  transmitter  at  Ava,  NY.  The  Doppler  spec¬ 
trum  of  backscattered  echoes  passing  through  these  irregularities  (or  reflected  by  them)  can  be  broadened  to  such  an 
extent  («  ±10  Hz)  that  targets  with  low  Doppler  velocity  are  undetectable.  Although  equatorial  spread-F  has  been 
studied  rather  extensively,  no  satisfactory  theory  can  address  such  questions  as  to  why  it  occurs  on  some  nights  and 
not  on  others  or  why  it  is  more  prevalent  during  particular  seasons  of  the  year  at  certain  locations  [Kelley,  1989]. 
Structures  with  spatial  scale  sizes  between  0.1  and  10^  m  have  been  detected  in  the  equatorial  ionosphere  and  various 
hypotheses  have  been  put  forth  to  explain  the  generation  of  the  observed  irregularities  [Kelley,  1992].  These  are: 

•  Gravity  wave  seeding  and  electrodynamic  uplift  (X  >200  km), 

•  Shear  effects  (200  km  >  X.  >  20  km), 

•  The  generalized  Rayleigh-Taylor  instability  (0.1  km  <  X.  <  20  km), 

•  Low-frequency  drift  wave  (1  m  <  X.  <  100  m), 

•  Lower  hybrid  drift  waves  (X.  <  1  m). 

In  an  attempt  to  predict  the  effect  of  ionospheric  irregularities  on  HF  propagation  [see  e.g.  Elkins,  1990], 
several  approaches  to  developing  a  model  of  SDC  have  been  undertaken,  several  of  which  utilized  phase  screens 
[Busier  et  al,  1988].  Another  approach  extended  a  statistical  model  of  clutter  in  the  auroral  ionosphere  to  the  equato- 
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rial  regions  [Elkins,  1987;  Elkins  and  Providakes,  1992],  Recently,  Coleman  [1993]  has  combined  the  statistics  of 
irregularities  with  ray  tracing  and  an  ionospheric  model  to  predict  the  characteristics  of  SDC.  This  model  shows 
that  the  upward  motion  of  the  ionosphere  at  sunset  leads  to  a  negative  Doppler  which  accumulates  to  produce  a  nega¬ 
tive  tilt  to  the  clutter.  When  the  equatorial  F2  peak  is  very  high,  the  model  suggests  that  rays  spending  a  consider¬ 
able  time  in  regions  of  strong  irregularities  are  those  returning  from  the  far  side  of  the  Appelton  anomaly.  In  the 
early  morning,  as  the  F2  peak  falls,  an  overall  positive  tilt  in  the  clutter  develops. 

Clutter  Mitigation  Techniques 

Because  SDC  can  mask  targets  of  interest,  there  is  considerable  motivation  to  find  solutions  which  will 
mitigate  these  effects.  However,  the  problem  becomes  complex  because  of  the  ambiguities  associated  in  selecting 
the  radar  parameters.  The  wave  form  repetition  frequency  (WRI^  determines  the  range  over  which  backscatter  echoes 
can  occur  without  range  aliasing.  For  example,  at  a  WRF  of  10  Hz,  the  unambiguous  range  extent  is  15,000  km 
but  the  unambiguous  velocity  extent  is  only  200  meters/sec.  Increasing  the  WRF  to  50  Hz  increases  the  Doppler 
resolution  (to  1450  meters/sec),  but  decreases  the  unambiguous  range  extent  to  only  3000  km.  At  the  higher 
WRFs,  backscatter  echoes  from  more  distant  ranges  will  be  aliased  into  the  range  window;  an  example  of  such  abas¬ 
ing  is  discussed  below. 

Several  clutter  mitigation  strategies  have  been  proposed;  one  recent  suggestion  has  been  the  deployment  of 
"stereo"  OTH  radars  that  view  a  common  ionospheric  volume  from  different  azimuths  [Barnum,  1991].  As  there  is 
some  evidence  that  equatorial  clutter  has  a  different  temporal  evolution  at  different  azimuths,  the  fusion  of  the  two 
OTH  data  sets  could  be  used  to  mitigate  against  strong  SDC  observed  by  one  or  the  other  of  the  radars.  Another 
proposal  has  been  to  take  advantage  of  the  paraUax  that  exists  as  a  result  of  the  separation  between  transmitter  and 
receiver  [Sievers,  1991]  in  system  siting.  This  solution  also  takes  into  consideration  the  azimuthal  dependence  of 
the  growth  of  irregularities  in  the  equatorial  ionosphere. 

A  third  approach  to  the  problem,  proposed  by  Decision-Science  Applications  (DSA),  Inc.,  is  the  utilization 
of  non-recurrent  wave  form  (NRWF)  coding  of  the  transmitted  signal.  This  technique  has  the  desired  effect  of  sup¬ 
pressing  the  range  ambiguous  clutter  by  means  of  modifying  the  ambiguity  diagram.  Various  NRWF  codes  were 
evaluated  (ad^tive,  random  binary,  Frank  and  quadratic)  with  regard  to  their  suppression  of  range  ambiguities  and  the 
degree  of  subclutter  visibility.  The  quadratic  NRWF  was  selected  because  it  modifies  the  classical  "bed  of  nabs" 
ambiguity  surface  such  as  to  fold  the  range  ambiguous  clutter  to  "unused"  negative  (positive)  Doppler  frequencies. 
The  capabUity  of  moving  the  SDC  to  either  positive  or  negative  frequencies  can  enable  OTH  operators  to  selectively 
search  for  targets  which  are  moving  toward  (away  from)  the  radar.  In  order  to  implement  the  quadratic  phase  code 
(QPC),  it  was  necessary  to  acquire  a  frequency  agile  signal  simulator  (FASS)  manufactured  by  Hewlett-Packard  that 
is  used  to  encode  the  transntitted  FM  wave  form  of  the  Ava  transmitter.  The  backscattered  signal  is  decoded  using 
matched  filter  techniques.  The  DSA  receiving  system  consists  of  six  Watkins-Johnson  receivers  which  interface 
with  the  VXI  bus  and  are  controlled  by  an  embedded  80486  DOS -based  computer.  The  quadrature  output  of  the 
receivers  is  digitized  under  the  control  of  a  68020  CPU  (operating  under  OS-9)  which  hosts  three  DSP  boards  each 
containing  three  TMS320C30  CPUs  on  a  VME  bus.  Conununication  between  the  CPUs  is  effected  with  ethemet. 
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The  implementation  of  the  QPC  wave  form  was  carried  out  by  DS  A  under  contract  to  RL  and  the  first  experimental 
field  tests  were  conducted  in  June  and  July  of  1994  using  in-house  RL  facilities. 

The  ambiguity  function  is  defined  as  the  absolute  value  of  the  envelope  of  the  output  of  a  matched  filter 
when  the  input  to  the  filter  is  a  Doppler-shifted  version  of  the  original  signal,  to  which  the  filter  was  matched 
[Levanon,  1988].  If  the  complex  envelope  of  the  signal  is  w(t),  the  ambiguity  function  can  be  expressed  by 


|X(T,V)|  = 


ju(t)u  (t  -  r)cxp(j27tvt)dt\ 


where  x  is  a  time  delay  and  v  is  the  Doppler  shift.  A  linear  FM  pulse  has  a  complex  envelope  which  is  written 


recti 


f  ^ 
t 

\^V) 


exp(j7ra  ) , 


where  tp  denotes  the  pulse  duration.  By  replacing  v  with  \+kx,  the  ambiguity  function  of  the  linear  FM  pulse  can 
be  written 

I''  '  ''|sin[;rrp(l-|T|/tp)(v+fcT)] 


|X(T,V)|  = 


l_ll. 

^PJ 


mp(l-\i:\/tp)(v+kr) 


,  \r\<tp. 


The  ambiguity  functions  for  a  single  pulse  and  pulse  trains  are  discussed  in  detail  in  the  monograph  by  Rihaczek 
[1977].  Detailed  modeling  of  the  ambiguity  function  for  the  QPC  has  been  carried  out  in  MATLAB®  by  DSA 
[Clancy,  private  communication]. 

Experimental  Program 

A  series  of  tests  were  begun  June  28,  1994  utilizing  the  RL  Verona  receiving  facility  and  the  Ava  trans¬ 
mitter  site.  Due  to  software  problems  in  the  receiving  hardware,  no  measurements  were  acquired  on  either  June  28  or 
June  29.  However,  these  initial  problems  were  overcome  and  data  were  recorded  during  the  evening  hours  of  June 
30.  Additional  measurements  were  made  during  three  nights  beginning  July  1 1,  1994.  During  this  series  of  tests, 
three  different  wave  forms  were  tested:  1)  a  WRF  of  48.8  Hz  using  the  QPC  to  effect  both  positive  and  negative  off¬ 
set;  2)  a  WRF  of  48.8  Hz  with  no  coding,  and;  3)  a  WRF  of  10  Hz  with  no  coding.  Details  of  the  wave  forms  are 
given  in  Table  1.  Transmissions  at  several  frequencies  were  used,  dependent  on  the  propagation  conditions  and  the 


WRF  (Hz) 

48.828 

48.828 

10 

Wave  form 

LFM  +QPC 

LFM 

LFM 

Doppler  Resolution  (Hz) 

0.375 

0.375 

0.5 

Unambiguous  Range  (km) 

3072 

3072 

15,000 

Number  of  range  bins 

128 

128 

625 

Table  1.  Details  of  transmitted  wave  forms  used  during  the  July  1994  Ava-Verona  experiments. 
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channel  occupancy;  the  signal  bandwidth  was  4  kHz  and  a  transmitted  power  of  100  kW  was  used.  A  rhombic 
antenna  (azimuthal  HPBW  «7.5®)  directed  at  184°  from  true  north  was  used  for  transmitting;  a  Beverage  array  having 
a  10°  HPBW  was  employed  at  the  Verona  site  for  receiving.  This  array  has  12  elements,  not  all  of  which  were  func¬ 
tional  during  the  field  tests. 

For  the  purposes  of  this  report,  only  data  acquired  on  July  14/15,  1994  will  be  discussed.  The  data  shown 
here  were  acquired  over  an  interval  of  «30  min,  between  0500  UT  and  0530  UT  on  15  July  1994.  Figure  1  shows  a 
Doppler-range  plot  where  the  unambiguous  range  is  15,000  km  (wave  form  3)  and  the  number  of  samples  was  20 
(Doppler  resolution  =  0.5  Hz).  Three  distinct  regions  of  backscatter  echoes  are  present  at  ranges  of  1824-3400  km, 
4250-6600  km  and  8780-9400  km.  The  same  data  are  shown  in  a  three-dimensional  format  in  Figure  2.  In  Figure 
3,  the  WRF  was  increased  to  48.828  Hz  (unambiguous  range  of  3072  km)  and  128  samples  were  digitized  (wave 
form  2).  In  this  case,  range  aliasing  occurs  and  clutter  (backscatter  echoes)  from  longer  ranges  is  folded  over  the 
whole  range  window.  As  a  result  of  the  higher  WRF,  the  spectra  shown  in  Figures  3,  4,  5  and  6  cover  a  frequency 
range  of  ±24  Hz.  The  results  shown  in  Figure  4  and  5  were  obtained  when  QPC  coding  was  applied  to  the  FMCW 
signal  using  a  WRF  of  48.828  Hz  (wave  form  1).  In  these  Figures,  the  first  range  window  (0-3072  km)  is  centered 
at  0  Hz;  echoes  from  longer  ranges  appear  at  integer  multiples  of  negative  Doppler  offsets  of  48.828  Hz/10.  For 
example,  echoes  between  3073  and  6144  km  are  centered  at  -4.8828  Hz.  Taking  into  account  the  Doppler  frequency 
at  which  the  echoes  occur,  these  data  can  be  compared  with  the  unambiguous  backscatter  data  shown  in  Figure  1 . 
Doing  so  indicates  that  backscatter  echoes  occur  between  2470-3650  km,  4700-7200  km  and  8850-10,800  km. 
These  ranges  are  in  general  agreement  with  the  three  groups  of  echoes  observed  using  a  WRF  of  10  Hz;  any 
disagreement  can  presumably  be  accounted  for  by  the  fact  that  the  two  soundings  were  acquired  30  minutes  apart, 
during  which  time  the  height  of  the  F2  region  increased.  Clutter  introduced  by  echoes  from  what  is  possibly 
ionization  created  by  a  meteor  can  be  seen  at  a  slant  range  of  500  km.  In  each  of  the  three  examples  discussed  here, 
the  transmissions  occurred  at  16.5  MHz  at  a  power  level  of  100  kW  and  a  bandwidth  of  4  kHz. 

The  Doppler  spectra  derived  from  the  data  shown  in  Figure  1  (10  Hz  WRF)  are  presented  in  Figure  6  and 
those  from  Figure  4  (48.8  Hz  WRF)  in  Figure  7.  Spectra  from  12  successive  range  gates  at  24  km  intervals,  start¬ 
ing  with  the  closest  range  in  the  upper  left-hand  comer  and  continuing  to  the  farthest  of  the  12  range  gates  in  the 
lower  right-hand  comer,  are  depicted.  In  panel  (a)  of  Figure  6,  data  from  range  gates  between  1920  and  2184  km  are 
shown.  These  data  have  a  relatively  narrow  spectral  peak  (~2  Hz)  which  shifts  from  positive  to  slightly  negative 
Doppler  values  with  increasing  range.  The  spectra  from  the  second  hop  ground  backscatter  (i.e.  4824-5088  and 
6024-6288  km)  exhibit  approximately  the  same  Doppler  width  and  the  peak  also  moves  from  positive  to  negative 
Doppler  values.  Although  the  clutter  detected  at  ranges  >9000  km  has  a  somewhat  broader  spectra,  none  of  these 
data  are  suggestive  of  the  spread  Doppler  clutter  observed  in  sector  7  by  the  ROTHR  system  during  the  January  1994 
SDC  campaign  [BoaK  l99A\RastogiandProvidakes,  1994]. 

In  Figure  7,  spectra  acquired  when  the  QPC  wave  form  was  transmitted  are  shown.  These  spectra  corre¬ 
spond  to  range  gates  between  (a)  2112  and  2376  km,  (b)  2616  and  2880  km  and  (c)  144  and  408  km  are  depicted. 
The  spectra  in  Fig.  7(a)  are  dominated  by  a  single  peak  near  -5  Hz  and  a  weak  secondary  maxima  near  -10  Hz.  A 
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peak  near  0  Hz  develops  at  ranges  beyond  —2200  km.  The  Doppler  offset  of  =5  Hz  indicates  that  the  dominant  peak 
occurs  at  a  range  of  >5200  km.  Two  spectral  peaks  occur  in  Fig.  7(b),  the  largest  near  0  Hz  and  a  secondary  peak  at 
«-4.8  Hz  indicative  of  echoes  at  =2600  km  and  =5700  km.  In  Figure  7(c),  showing  spectra  between  144-404  km, 
four  distinct  peaks  occur  corresponding  to  echoes  at  ranges  of  144,  3216,  6288  and  9360  km.  The  echo  at  144  km  is 
the  ground  wave  signal  from  the  Ava  transmitter. 

Ionospheric  ray  tracing  programs  can  be  used  as  a  tool  to  predict  or  model  HF  propagation  paths.  One  such 
program  [Argo  et  al,  1990]  was  implemented  by  the  author  on  a  Sun  Workstation  at  RL  and  employed  to  model  the 
path  of  a  16.5  MHz  radio  wave  transmitted  from  the  Ava  site.  On  July  15,  the  effective  sunspot  number  was  35  and 
the  geomagnetic  field  was  at  minor  stonn  levels  (Kp=4+)  at  the  time  of  the  backscatter  soundings.  Results  from  the 
LANL  Tracker  program  are  shown  in  Figure  8a,b  where  a  path  due  south  from  the  Ava  field  site  (43.5°N,  75.1°W) 
was  specified.  For  a  sunspot  number  of  35,  no  HF  propagation  at  16.5  MHz  is  sustainable  (panel  a);  when  the  F- 
region  density  is  increased  using  a  sunspot  number  of  100,  rays  with  elevation  angles  of  4-7°  are  propagated.  The 
entire  24  hr  interval  (at  two  hour  increments)  has  been  simulated,  the  results  of  which  are  shown  in  Appendix  A  (not 
included  in  this  final  report).  At  local  noon  (16  UT),  the  E-region  is  present  and  nearly  all  the  low  angle  rays  are 
propagated  by  reflection  from  that  region,  although  some  higher  angle  rays  undergo  reflection  from  the  F-layer.  The 
first  hop  Earth  reflection  occurs  approximately  1600  km  down  range.  The  propagation  conditions  do  not  change 
drastically  until  1800  LT,  when  ionization  in  the  F-region  decreases  and  F-layer  reflections  begin  to  dominate.  By 
2000  LT,  all  propagation  occurs  via  the  F-region  and  the  first  hop  Earth  reflection  is  at  a  range  of  =2000  km.  As 
the  F-region  peak  increases  in  height,  the  distance  at  which  the  first  Earth  reflection  occurs  also  increases,  and  is 
=2500  km  down  range  at  0530  UT.  This  result  is  generally  consistent  with  the  observed  backscatter  ionograms. 
The  F-region  begins  to  form  again  at  0400  LT  and  short  range  propagation  is  possible  at  some  elevation  angles.  At 
this  time,  many  of  the  rays  propagate  through  the  ionosphere  and  there  is  a  time  near  0600  LT  when  only  long  dis¬ 
tance  propagation  can  occur.  Presumably,  ionospheric  tilts  associated  with  sunrise  are  responsible  for  this  chaotic 
interval  of  propagation.  As  the  Sun  ionizes  the  F-region,  it  again  becomes  the  dominant  layer  and  sustains  most  of 
the  low  angle  propagation. 

Conclusions 

The  initial  series  of  field  experiments  conducted  at  RL  has  demonstrated  that  the  QPC  wave  form  can  be 
successfully  applied  to  OTH  backscatter  radar  systems,  shifting  the  range-folded  backscatter  echoes  to  higher  or  lower 
frequencies  in  Doppler  space.  What  has  not  yet  been  demonstrated  is  the  application  of  this  method  under  conditions 
when  equatorial  clutter  was  known  to  have  occurred.  In  this  context,  it  should  be  noted  that  no  prior  experiments 
have  demonstrated  that  SDC  from  the  equatorial  ionosphere  can  be  detected  using  the  Ava-Verona  facilities.  It  is 
certainly  possible  that  the  relatively  small  receiving  array  at  Verona  does  not  have  enough  gain  to  detect  SDC,  which 
is  often  20-30  dB  weaker  than  the  normal  ground  clutter.  Certainly,  none  of  the  spectra  acquired  thus  far  exhibit  the 
broad  Doppler  frequency  extent  that  have  been  reported  by  operational  OTH  radar  systems  [Boak,  1994;  Rastogi  and 
Providakes,  1994;  Coleman  and  Ward,  1994].  Backscattered  echoes  from  the  Ava  transmitter  should  undergo 
Doppler  broadening  at  ranges  between  =4300  and  7600  km,  which  more-or-less  coincides  with  the  second  group  of 
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echoes  in  Figure  1.  In  addition,  the  performance  of  the  wave  form  in  an  operational  (i.e,  tracking)  mode  has  also  not 
been  studied  and  the  question  of  how  targets  might  be  aliased  in  Doppler  space  remains  open. 

Future  experiments  ought  to  focus  on  the  following  aspects: 

1)  Apply  only  the  uncoded  10  Hz  WRF  and  the  coded  48.8  WRF  with  short  time  intervals  («  5  min)  between 
the  two  formats; 

2)  Test  codes  that  enable  larger  frequency  offsets  in  Doppler  space; 

3)  Utilize  longer  coherent  integration  times  to  obtain  sea  backscatter  spectra  so  as  to  verify  system  perfor¬ 
mance; 

4)  Test  higher  WRFs  in  the  range  60  to  100  Hz  [Clancy,  private  conununication]; 

5)  Conduct  extended  operations  to  acquire  data  during  the  day-night  and  night-day  transition  in  the  equatorial 
region; 

6)  Document  the  occurrence  of  spread  Doppler  clutter  in  the  appropriate  ROTHR  beam; 

7)  Acquire  time-contiguous  data  to  demonstrate  tracking  capability; 

8)  Incorporate  improvements  to  the  receiving  antenna  array,  such  as  modifying  the  VALAR  array  to  look 
south  instead  of  north; 

9)  Utilize  additional  receivers,  perhaps  for  impulsive  noise  reduction  (thus  far,  only  one  of  six  has  been  used); 

10)  Employ  more  modem  spectral  analysis  techniques  in  the  data  analysis  process  [e.g.  Bourdillion  et  al,  1987; 

Cooley  et  al,  1987]. 
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Figure  4 
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Figure  6(d) 
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Figure  7(a) 


Figure  7(b) 


5-18 


Figure  7(c) 
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A  STUDY  OF  THE  APPUCABIUTY  OF  FRACTALS  AND  KINETIC  EOUATIONS  TO 
ELECTROMIGRATION  AND  THERMALLY  INDUCED  HILLOCK  AND  VOID  EVOLUTION 


Martin  Villarica,  Graduate  Research  Assistant 
Joseph  Chaiken*,  Associate  Professor 
Department  of  Chemistry 
Syracuse  University 


Abstract 


The  growth  of  hiiiocks  and  voids  in  metai  fiims  was  studied.  The  applicability  of  a  model 
involving  fractais  and  kinetic  equations  was  examined  on  the  basis  of  whether  there  is 
independent  justification  for  using  scaling  arguments  in  the  model  and  for  whether  there  is 
reason  to  connect  the  evoiution  of  hiiiocks  with  that  of  voids.  Hillocks  and  voids  were  found  to  be 
self-similar  across  about  three  orders  of  magnitude  of  variation  in  spatial  scale  with  the  same 
fractal  dimension.  Voids  and  hiiiocks  were  found  to  have  the  same  fractal  dimension  whether 
studied  using  atomic  force  microscopy  (AFM)  or  scanning  electron  microscopy  (SEM).  The 
parameters  obtained  from  these  fractal  analyses  demonstrate  quantitative  internal  consistency 
with  an  earlier  time  dependent  study  of  thermal  annealing  effects  on  hillock  distributions. 
Remarkably,  area-perimeter  data  obtained  from  either  a  long-time  study  of  a  single  void  or  a 
spatial  average  of  a  large  number  of  different  voids  both  yield  quantitatively  identical  results. 
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A  STUDY  OF  THE  APPLICABILITY  OF  FRACTALS  AND  KINETIC  EQUATIONS  TO  ELECTROMIGRATION 
AND  THERMALLY  INDUCED  HILLOCK  AND  VOID  EVOLUTION 


Martin  Villarica  and  Joseph  Chaiken* 


Introduction 

We  recently  reported^  the  very  encouraging  results  of  our  first  attempt  at  applying  a 
kinetic  model^  to  the  time  evolution  of  the  growth  of  hillocks  in  metal  films.  These  results  raise 
various  questions  requiring  satisfactory  exploration  before  we  can  confidently  form  conclusions 
regarding  any  possible  relationship  between  the  assumptions  and  parameters  of  the  model  and 
actual  physical  structures  and  interactions.  Fundamental  to  the  model  are  assumptions  and 
parameters  derived  from  the  scaling  behavior  of  the  microscopic  parameters  describing  the 
entities  responsible  for  hillock  and  void  growth.  Additional  relationships  between  the  spatial, 
temporal  and  mass  scaling  of  the  equations  describing  the  interactions  between  those  entities 
also  require  exploration.  In  what  follows  we  describe  our  attempt  to  obtain  independent  evidence 
justifying  the  scaling  assumptions  of  the  model  and  their  application  to  electromigration. 

First,  we  will  briefly  review  the  model  and  the  results  of  our  application  of  the  model  to 
data  obtained  independently  by  Vook  and  coworkers^>'^.  Naturally,  the  questions  we  hoped  to 
address  this  summer  are  articulated  in  the  succeeding  section.  We  then  present  the  results  of 
atomic  force  microscopy(AFM)  and  scanning  electron  microscopy{SEM)  experiments  on 
aluminum  films  which  provide  some  answers  or  at  least  insights  into  these  questions.  It  should 
be  clearly  noted  from  the  outset  that  what  we  seek  Is  a  general  model  for  rearrangement  of  thin 
film  matter  when  subjected  to  thermal  and/or  electromigration  stress  and  so  much  work  will  be 
needed  to  establish  the  reality  of  the  physical  picture  suggested  by  any  model.  We  believe  the 
potential  empirical  use  of  the  model  for  reliability  and  failure  prediction  will  be  clear  in  any 
case.  Whereas  the  log  normal  distribution  has  been  applied  to  electromigration  in  various  ways, 
our  model  employs  the  Poisson  distribution  which  usually  fits  experimental  data  equally  well  or 
better.  Our  model  assigns  physically  meaningful  quantities  to  the  microscopic  parameters  of  the 
Poisson  distribution  which,  at  worst,  could  be  the  basis  for  a  semi-empirical  parameterization. 
The  log  normal  distribution  cannot  offer  the  same  opportunity  for  connection  with  microscopic 
causes  and  effects. 
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The  Model 

The  model  has  been  described  in  detail  in  the  literature^-^  and  is  based  on  the 
Smoiuchowski  equations  (1)  with  Jullien's  asymptotic  solution®  (2)  giving  the  time  dependent 
concentration,  nk,  of  a  cluster,  hillock  or  some  other  association  or  entity  having  k  monomers 
units,  i.e.  a  k-mer.  The  model  assumes  that  k-mers  are  only  produced  by  coalescence  of  smaller 
clusters  and  are  only  destroyed  when  they  coalesce  with  another  cluster  to  form  a  still  larger 
cluster.  If  there  was  a  population  of  monomers  at  t=0,  say  metal  atoms  impinging  on  a  substrate 
from  the  gas  phase,  the  formation  of  a  film  would  commence  by  the  formation  of  associations 
between  the  atoms.  These  "associations"  might  be  thought  of  as  islands,  hillocks  or  some  other 
name  depending  on  their  specific  properties. 
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Kj.k-jni-nK.j  -  nk  S  Ki.kn, 

^  J  =  1  J  =  1 

Hk  C  k®  e-bk 


(  1  ) 
(2) 


Although  this  model  seems  to  contain  much  of  the  physical  nature  of  the  hillock/void 
formation  and  growth  process,  realistically,  we  know  there  are  at  least  a  few  potential 
processes  which  could  lead  to  fragmentation  of  some  islands  or  clusters.  Although  both 
unimolecular^  and  bimolecular  processes®  can  be  imagined,  we  will  group  them  all  together 
under  the  heading  of  "evaporation".  There  is  no  explicit  term  for  evaporation  at  this  point 
except  to  note  that  the  rate  constants  used  in  (1)  implicitly  relate  the  fact  that  not  all 
encounters  between  i-mers  and  j-mers  result  in  coalescence.  The  greater  the  probability  of 
evaporation  occurring  on  some  time  scale  related  to  the  averaged  time  before  the  next  collision 
occurs,  the  smaller  the  crossections  or  rate  constants  used  in  (1).  Since  in  this  model  the 
presence  of  a  buffer  system,  e.g.  buffer  gas  in  gas  phase  systems  or  codeposition  gas  in  matrix 
isolation,  does  not  appear  explicitly,  these  rate  constants  contain  this  information  implicitly 
when  the  equations  are  fit  to  real  data.  The  asymptotic  solution,  (2),  can  be  interpreted  in 
terms  of  the  scaling  of  the  bimolecular  rate  constants,  Kjj,  relating  the  interaction  of 
individual  agglomerating  entities,  e.g.  i-mers  and  j-mers,  with  increasing  numbers  of 
monomers  in  the  clusters.  Under  certain  situations®,  it  is  thought  that  the  rate  constants  could 
scale  smoothly  with  increasing  the  number  of  monomer  units,  and  if  (3)  is  satisfied  they  are 
termed  "homogeneous". 
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(3) 


The  most  empirical  mathematical  implications  of  homogeneity  and  scaling  are  best  summarized 
by  noting  that  they  allow  the  rate  constants,  the  Kjj,  to  be  written  explicitly  in  terms  of  i  and  j. 
This  allows  analysis  of  the  equations  and  the  possibility  of  discovering  rigorous  closed  form 
mathematical  connections  between  microscopic  parameters  defining  the  Kjj  and  actual 
measurements  of  hillock  or  cluster  size  distributions. 

Although  we  shall  choose  not  to  do  so  in  this  work,  it  is  possible’’ °  to  factor  the  Kjj  into 
a  product  of  two  functions.  The  first  involves  only  those  factors  which  calculate  the  probability 
of  a  hard  sphere(or  other  generic  shape)  encounter.  The  second  is  a  probability  factor  relating 
the  likelihood  of  an  encounter  ending  in  a  certain  action,  e.g.  coalescence.  The  first  factor  is 
essentially  the  collision  frequency,  i.e.  related  to  overall  mass  transfer  conditions,  while  the 
second  contains  information  related  to  appropriately  averaged  trajectories  and  so  contains  the 
chemistry  of  the  each  reaction  system  in  the  agglomeration  process.  For  very  small  clusters  and 
entities,  the  observation  of  quantum  effects  is  expected  in  the  form  of  discontinuities  in  the 
variation  of  the  Kjj  with  size. 

In  the  absence  of  the  above  factoring  of  the  Kjj,  the  model's  results,  e.g.  a  distribution  of 
cluster  sizes  produced  in  the  long  time  limit,  show  what  would  happen  if  the  distribution 
function  was  determined  purely  by  kinetics.  These  kinetics  would  be  determined  entirely  by 
mass  transfer  as  if  there  were  no  preferred  pathways  for  the  formation  or  destruction  of  any 
size  satisfying  the  homogeneity  condition.  The  Stockmayer  formula^  ^  from  polymer  chemistry 
relating  the  number  of  independent  kinetic  pathways  producing  a  given  size  cluster  to  the 
number  of  monomers  in  that  product  cluster  plays  a  prominent  role  in  the  rigorous  proof  of  the 
validity  of  (2).  We  assume  that  neither  the  total  number  of  monomers,  whether  free  or  in 
clusters,  nor  the  temperature,  varies  with  time.  The  model  Is  a  so-called  "mean  value"  model  in 
that  there  is  assumed  to  be  no  spatial  correlation  between  the  locations  of  the  various  reactants 
and  products. 

These  scaling  laws®  are  consistent  with  specific  limits  of  attainable  reaction  conditions. 
They  are  based  on  rigorous  calculation  of  things  like  how  the  mass  of  a  cluster  increases  with 
increasing  number  of  monomer  units  while  the  velocity  of  the  clusters  decreases.  The  scaling  of 
mass  transfer  and  energy  transfer,  in  addition  to  reaction  rates,  depends  on  such  quantities.  The 
scaling  of  surface  area  of  a  cluster  with  increasing  number  of  monomers  is  another  such 
quantity.  The  distribution  of  the  mass  of  the  clusters  with  increasing  monomer  size  is  yet 
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another.  Assumptions  of  Jullien®>®  and  others  applying  the  Smoluchowski  equation  embrace 
specific  ways  of  mathematically  representing  these  quantities  so  as  to  make  the  equations 
describe  systems  which  are  "self-similar"  across  spatial  and  temporal  scales.  We  shall  define 
the  term  "self-similar"  in  this  connection  more  precisely  a  little  later. 

We  assert  that  the  more  often  a  given  cluster  encounters  other  clusters,  the  more 
rapidly  it  coalesces  with  other  clusters.  As  a  cluster  gets  larger,  it  becomes  more  massive, 
therefore  moving  more  slowly,  thereby  exploring  space  more  slowly  and  encountering  fewer 
potential  coalescence  partners  per  unit  time.  The  result  is  a  slower  growth  rate.  Balanced 
against  this  effect  is  the  fact  that  as  monomers  or  other  size  clusters  are  added  to  a  cluster,  it 
must  get  larger  so  its  cross  section  increases.  Because  of  this,  more  space  is  explored  per  unit 
time  and  so  a  faster  growth  rate  obtains.  The  overall  effect  depends  on  whether  a  cluster 
becomes  spatially  larger  faster  than  they  become  slower  due  to  becoming  more  massive.  The 
validity  or  importance  of  this  physical  picture  in  determining  size  distributions  and  other 
observable  characteristics  of  coaiescence  growth  systems  is  one  of  the  fundamental  questions 
regarding  the  entities  responsible  for  hillock  and  void  growth  mentioned  in  the  Introduction. 

It  is  possible  to  show®  that  these  effects,  as  well  as  the  scaling  of  concentrations  with 
varying  spatial  scales,  are  connected  by  (4).  This  equation  can  be  shown  valid  in  a  variety  of 
commonly  attainable  sets  of  reaction  conditions. 

2co  =  a  -I-  (d  -  dw)/D  (  4  ) 

(4)  is  obtained  by  considering  the  relationship  between  the  spatial,  temporal  and  mass  scaling 
of  the  Smoluchowski  equations.  Mathematically,  one  considers  a  microscopic  collision  frequency 
or  rate  constant  and  the  effect  of  increasing(or  decreasing)  each  length  which  enters  into  the 
microscopic  calculation  of  the  rate  constant  or  in  expressing  the  concentrations.  Calculating  the 
reciprocal  temporal  scale  change  needed  to  make  the  terms  of  the  Smoluchowski  equations 
invariant  to  the  net  effect  of  the  two  scale  changes,  one  encounters  (4)  as  a  necessary  condition. 
We  do  not  know  whether  other  forms  of  the  kernels  exist  which  also  produce  self-similar 
distributions.  What  is  important  for  this  effort  is  that  Jullien's  solution  does®. 

The  first  parameter,  a,  is  obtained  due  to  the  scaling  of  velocity  with  increasing  cluster 
mass.  It  will  usually  be  negative.  For  an  isothermal  system  at  thermodynamic  equilibrium, 
heavy  objects  move  more  slowiy  than  light  ones. 
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Vrms-V^ 


(5) 


(5)  is  simply  the  root  mean  square  speed  of  an  ideal  gas  molecule  of  mass  m  at  temperature  T. 
Vrms  obviously  scales  as  m'^^^  so  when  the  mass  transfer  is  limited  by  the  molecular  speeds 
and  not  the  concentration  gradient,  i.e.  the  ballistic  regime  and  not  the  diffusive,  a=  -0.5  .  The 
parameter  dw.  the  "fractal  dimensionality  of  the  cluster  trajectories",  results  from  the  scaling 
of  the  mean  square  displacement  between  collisions  with  increasing  cluster  mass.  When  motion 
is  diffusive,  dw=2,  when  ballistic,  dw=1  •  The  dimensionality  of  the  space  in  which  the 
coalescence  occurs  is  d.  For  clusters  formed  in  bulk  gases,  d=3.  For  islands  formed  on  single 
crystal  surfaces  by  interaction  of  adsorbates  diffusing  along  the  surface,  d=2.  The  fractal 
dimensionality  of  the  clusters,  defined  in  terms  of  a  mass  fractal,  is  D.  These  parameters  define, 
and  provide  quantitation  of,  energy  and  mass  transfer  properties  of  the  reacting  system  and 
chemical  properties  of  the  clusters. 

When  the  shape  of  an  experimentally  produced  cluster  size  distribution  can  be 
quantitatively  calculated,  based  on  scaling  properties  and  the  assumption  that  the  kernels  are 
homogeneous,  we  accomplish  an  enormous  decrease  in  the  number  of  variables  needed  to  model 
the  system.  There  are  physical  implications  as  well.  For  cluster  or  hillock  sizes  corresponding 
to  homogeneous  kernels,  there  are  no  special  structures  or  interactions  that  occur  as  the 
clusters  become  larger.  Alternatively,  it  may  also  be  true  that  a  given  sized  cluster  occurs  in 
various  equivalent  structures.  This  is  consistent  with  the  idea  that  the  Smoluchowski  equations 
produce  self-similar  cluster  size  distributions,  it  has  been  stated^  2  that  surfaces  produced 
from  distributions  of  self-similar  hillocks  and  voids  are  self-similar,  i.e.  fractal. 

Application  to  Vook's  Data 

Vook  and  coworkers  allowed  us  access  to  several  hillock  size  distributions  that  they 
produced  using  evaporated  aluminum  and  aluminum-copper  films  on  silicon  substrates. 
Obtained  using  SEM,  these  distributions  corresponded  to  a  nominal  magnification  of  10,000x. 
The  films  systematically  surveyed  a  range  of  deposition  conditions,  i.e.  substrate  temperature 
and  deposition  rate.  The  data  were  well  fit  using  any  of  a  few  different  distributions.  Overall,  the 
Poisson  distribution  usually  fit  best  for  pure  aluminum  films  while  alloys  were  often  fit 
slightly  better  by  a  log  normal  distribution.  Based  on  the  quality  of  the  fits  it  could  certainly  be 
said  that  homogeneous  kernels  could  correspond  to  the  physical  situation  represented  by  these 
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films.  Vook  approximated  the  hillocks  as  non-overlapped  hemispheres:  his  raw  data  consisted  of 
lists  of  diameters. 

Vook's  data  suggested  that  SEM  at  10,000x  magnification  is  not  adequate  to  discern  the 
smallest  and  most  mobile  participants  in  the  coalescence  process.  The  distributions  therefore 
poorly  reflected  the  value  of  a  from  (4)  that  could  be  obtained  from  the  fits  to  (2).  The  value 
of  b  from  (4)  should  be  reasonably  well  determined.  In  fact,  the  b  parameter  is  valuable 
because  a  plot  of  ln(b)  vs  In(time)  for  the  time  evolution  of  a  given  system  should  be  linear 
with  slope  1/(2co-1).  Since  a=-2(o,  having  such  data  allows  a  test  of  internal  consistency  for 
the  application  of  the  model  to  a  given  system.  In  the  case  of  Vook's  data,  the  value  of  a  obtained 
from  the  plot  of  ln(b)  vs  In(time)  was  -0.25  and  comparable  to  the  average  value  obtained 
from  all  the  fits  to  the  Poisson  distribution  directly,  -0.21 .  On  the  basis  of  this  and  other 
results  the  application  of  the  model  was  very  encouraging. 

The  Summer  of  '94 

There  were  many  potential  goals  which  we  could  have  pursued  this  summer  that  would 
have  been  intelligent  extensions  of  our  work  with  Vook's  data.  We  decided  to  proceed  along  a  few 
different  tracks  in  order  to  hedge  against  the  possibility  that  various  potential  difficulties  would 
defeat  one  or  more  of  our  efforts. 

We  hoped  to  be  able  to  duplicate  Vook's  time  dependent  annealing  experiment  using  a  pure 
metal  film,  preferably  Al,  with  greater  resolution.  We  were  able  to  obtain  a  time  dependent 
data  set  on  a  pure  aluminum  film.  The  magnification  was  only  5,000x  and  the  number  of 
hillocks  found  was  in  fact  less  than  the  number  obtained  by  Vook.  Our  statistics  cannot  be  as  good 
as  Vook's  data  and  we  wiil  still  have  a  poorly  determined  a  parameter  from  the  direct  fit  of  the 
Poisson  distribution  function  to  the  experimental  data,  but  we  will  be  able  to  obtain  a  plot  of 
ln(b)  vs  In(time).  When  completed,  we  will  have  an  a  value  which  can  be  compared  with  that 
obtained  by  us  using  Vook's  data.  There  is  a  real  possibility  that  we  can  combine  this  data 
collected  by  Dr.  Walsh  with  some  data  collected  by  Jennifer  Synowczynski  which  will  improve 
our  statistics.  This  data  is  still  being  analyzed  and  the  results  cannot  be  discussed  further.  There 
is  no  question  that  we  will  ultimately  want  to  apply  this  model  to  films  of  various  metals  on  a 
variety  of  different  substrates. 

It  would  clearly  be  useful  to  obtain  data  at  much  higher  resolution  so  that  an  accurate  and 
precise  a  value  can  be  determined  directly  from  the  fit  to  the  Poisson  distribution.  We  hoped  to 
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be  able  to  obtain  independent  evidence  that  scaling  arguments  and  scaling  based  models  have 
relevance  to  hillock  and  void  evolution  in  thermal  and  electromigration  stressed  films.  It  would 
also  be  important  if  we  could  establish  any  quantitative  and  preferably  analytic,  direct 
correlation  between  hillock  evolution,  to  which  our  kinetic  model  pertains,  and  void  evolution 
which  is  clearly  of  central  importance  in  electromigration  induced  device  failure. 

We  decided  to  attempt  to  establish  the  proposition  that  voids  and  hillocks  in  metal  films 
are  self-similar.  Establishing  this  as  fact  would  establish  the  relevance  of  scaling  in  the 
analysis  of  hillock  and  void  distributions.  Using  atomic  force  microscopy(AFM),  it  would  be 
possible  to  establish  self-similarity  and  to  examine  the  hillock  distributions  at  much  higher 
resolution  than  is  possible  using  SEM  on  these  types  of  samples.  It  should  be  pointed  out  that 
SEM  studies  cannot  be  performed  using  conductive  paints  or  other  approaches  to  keep  charging 
effects  from  limiting  the  resolution  since  any  approach  which  contaminates  the  surface  is  not 
compatible  with  time  dependent  studies. 

Self-Similarity 

The  question  of  exactly  what  physical  meaning,  if  any,  the  term  "self-similarity" 
denotes  is  deserving  of  some  attention  at  this  point.  Within  the  context  of  having  limited  space, 
we  begin  with  an  example  of  a  generic  mathematical  definition  paraphrased  from  Korvin’'^.  We 
consider  a  physical  structure,  perhaps  a  hillock  in  a  metal  film,  the  Nile  river,  or  the 
Andromeda  galaxy.  It  is  possible,  for  whatever  reason,  to  consider  various  ways  to  characterize 
the  object's  shape.  In  the  effort  to  do  so,  we  might  consider  a  single  object  or  set  of  objects,  that 
for  independent  reasons  we  suppose  might  have  something  to  do  with  each  other,  and  the  set's 
distinguishing  characteristics  as  compared  with  other  sets  of  objects. 

Attempting  to  be  exhaustive  in  our  considerations,  we  ask  whether  there  exists  a 
relationship  between  the  area  of  an  object,  or  a  2-dimensional  projection  of  the  object,  and  the 
perimeter  of  the  object.  The  comparison  is  based  on  the  idea  that  a  perimeter  increases  linearly 
as  larger  objects  in  a  self-similar  set  are  considered,  whereas  the  area  increases  quadratically 
as  larger  members  of  the  set  are  considered.  "Larger"  might  be  based  on  any  convenient  measure 
such  as  the  largest  distance  across  the  interior  of  the  object,  the  perimeter,  or  the  area.  Since 
we  have  been  careful  to  make  no  assumptions  regarding  the  actual  shape  of  any  objects  we  may 
consider,  the  ratio  of  these  two  quantities  might  be  sensitive  to  the  shape  of  the  objects. 
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We  can  express  the  idea  of  considering  the  vaiue  of  one  measure  of  size  and  shape,  say  w, 
in  terms  of  the  simultaneous  value  of  some  other  measure  of  size  and  shape,  say  L  in  terms  of 
the  equation  below. 


w=W(L) 

Now  if  this  relationship  holds  for  all  the  members  of  the  set,  then  it  may  be  possible  to  write  an 
equation  which  connects  the  members  of  the  set  in  terms  of  the  scaling  of  perimeter  or  area,  i.e. 
L.  Furthermore,  if  it  is  possible  to  write  (6)  below  in  which  f  is  an  unknown  function  which 
depends  on  the  shape  of  the  objects.  This  form  amounts  to  a  very  significant  assumption  because 
if  a  set  of  objects  can  be  found  which  satisfy  (6),  then  (7)  can  also  be  shown  to  be  true  and  the 
objects  form  a  self-similar  set. 

W2/W1  =  W(L2)/W(Li)  =  f(w2/wi)  (6) 

f(ab)=f(a)f(b)  (7) 

It  is  then  possible  to  apply  Cauchy's  treatment  of  functional  forms  to  write  that  the  function  f 
connecting  values  of  the  two  measures  must  be 


w  =  wqLP 


(8) 


of  the  form  above.  At  first,  this  may  seem  like  a  real  leap  forward  in  that  we  have  a  functional 
form  which  at  the  very  least  allows  us  to  classify  sets  of  objects.  The  use  of  group  theory  to 
classify  various  matrix  elements  by  their  behavior  with  respect  to  their  behavior  under 
various  point  and  space  group  transformations  seems  analogous.  Empirically,  self-similarity 
and  point  group  symmetry  are  analogous  in  that  they  allow  a  reduction  in  the  number  of 
variables  needed  to  specify  the  state  of  a  given  system  but  in  the  case  of  self-similarity  we 
presently  have  very  little  with  which  to  satisfy  our  intuition  as  to  why  the  classification  should 
be  physically  meaningful. 


First,  there  is  nothing  in  the  definition  of  self-similarity,  i.e.  satisfying  the  equations 
above,  which  pretends  to  allow  a  general  first  principles  calculation  of  the  prefactor  f  or  p. 
Each  case  must  be  considered  as  an  isolated  situation.  The  significance  and  physical 
interpretation  of  the  exponent  p,  known  in  the  vernacular  as  a  variety  of  different  kinds  of 
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fractal  dimension  depending  on  the  specific  case  under  consideration,  beyond  the  idea  that  area 
or  perimeter  increases  as  an  object  gets  larger,  is  usually  difficult  to  clearly  discern.  The  fact 
that  we  have  chosen  to  express  these  measures  of  size  and  shape  in  terms  of  each  other  and 
examine  their  scaling  behavior  is  not  guaranteed  to  have  any  particular  significance.  To  state 
one  overall  benefit  of  using  self-similarity  we  directly  quote  H.  E.  Stanley '*2. 

"If  you  are  an  experimentalist,  you  try  to  measure  the  fractal  dimension  of 
things  in  nature.  If  you  are  a  theorist,  you  try  to  calculate  the  fractal 
dimension  of  models  chosen  to  describe  experimental  situations;  if  there  is  no 
agreement  then  you  try  another  model." 

In  addition  to  having  discovered  an  empirical  measure  that  a  correct  theory  of  whatever 
processes  being  considered  must  correctly  reproduce,  if  one  has  a  theory  which  predicts  self¬ 
similar  objects,  then  discovering  that  they  exist  in  reality  clearly  supports  the  model.  If  a 
model  assumes  self-similarity  and  makes  predictions  which  turn  out  to  be  true,  then  this  is 
good  but  we  are  still  not  guaranteed  that  the  model  is  unique.  There  could  be  another  model  not 
based  on  self-similarity  which  also  accounts  for  the  empirical  data.  However,  discovering  that 
the  objects  are  actually  self-similar  clearly  supports  the  theory  which  makes  that  assumption. 
Our  hillock  model  expresses  the  mass  distribution  in  terms  of  a  mass  fractal,  i.e.  self¬ 
similarity  is  assumed.  Another  important  benefit  is  that  self-similarity  allows  construction  of 
a  models  having  relatively  few  adjustable  parameters.  Therefore  we  attempted  to  find  our 
whether  hillocks  and  voids  under  thermal  annealing  or  electromigration  conditions  are  self¬ 
similar. 

Again,  following  Korvin^^  and  Stanley:  "The  numerical  determination  of  fractal 
dimension  of  an  object  embedded  into  the  d-dimensional  Euclidean  space  is  generally  based  on 
the  following  simple  rule:  Suppose  we  plot  on  log-log  paper  some  quantity  that  can  be 
interpreted  as  'number  of  objects'  or  'mass'  against  a  characteristic  length  scale  L.  If  there  is 
an  asymptotic  (L->°o)  region  in  which  this  plot  becomes  straight,  then  the  slope  is  termed  the 
fractal  dimension  dq  characterizing  the  distribution  of  the  quantity  q.  Alternately,  we  can  keep 
the  size  of  the  object,  L,  fixed  and  determined  q  for  different  resolutions,  I.  If  we  plot  q  against 
I,  and  the  plot  is  a  straight  line  in  a  range  (I  <  Iq  «  L).  fhe  fractal  dimension  will  be  the 
negative  slope  of  this  line.  For  example,  if  q  is  the  area  of  a  square  of  side  L,  it  scales  with  L  as 
L^,  if  is  the  number  of  squares  of  side  l«L  contained  in  the  large  square,  it  scales  as:" 
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q=(L/l)2  a  1-2 


Of  course,  this  all  follows  from  (6-8)  above. 

When  examining  surfaces,  there  is  considerable  latitude  in  defining  hillocks  and  voids. 
The  AFM  data  consists  of  a  set  of  points,  x,  y,  z  defining  the  surface  contour  of  the  sample.  We 
choose  to  view  the  surface  as  a  collection  of  hillocks  and  voids  because  we  believe  the  evolution 
of  hillocks  parallels  the  evolution  of  voids.  The  fractal  dimension  of  a  set  of  hillocks  is 
conveniently  calculated  using  a  so-called  "area-perimeter  plot".  Using  (6-8)  above  it  is  easy 
to  show  that  a  plot  of  the  logarithm  of  the  area  of  a  set  of  hillocks  as  a  function  of  the  logarithm 
of  the  perimeter  is  linear  with  slope  D  when  the  hillocks  are  a  self-similar  set.  D  is  the  fractal 
dimension  of  the  projection  of  the  hillock  on  2  dimensions.  The  fractal  dimension  of  the  surface 
composed  of  the  hillocks  or  voids  is  commonly  taken  as  D-i-1  although,  according  to  Korvin^®, 
the  rigorous  proof  of  this  has  not  been  given.  Given  AFM  data,  one  need  only  choose  some  plane  of 
reference  in  order  to  designate  all  the  surface  structures  contained  in  the  image  as  either 
hillocks  or  voids. 

The  choice  of  reference  plane  requires  some  discussion  of  thresholding.  The  basic  fact  of 
scanning  probe  microscopy  is  that  some  type  of  small  probe  is  brought  into  close  proximity  to 
an  object  under  study.  As  the  probe,  which  is  a  very  fine  tip  of  one  sort  or  another,  approaches 
the  the  object  to  within  a  distance  on  the  order  of  molecular  dimensions,  a  variety  of  forces 
between  the  surface  and  the  tip  become  measurable.  By  sensing  one  of  the  processes  driven  by 
these  forces,  e.g.  electrical  conduction,  van  der  Waals  attraction  or  repulsion,  it  is  possible  to 
sense  how  close  the  tip  is  to  the  object.  This  allows  the  mapping  of  the  surface  contours  of  the 
object  as  the  tip  is  rastered  across  the  region  of  interest  at  constant  height.  Balancing  these 
forces  allows  assignment  of  a  z  value  to  each  x,y  point  on  the  surface.  "Thresholding"  refers  to 
construction  of  iso-z  contours  in  the  x,y,  plane.  The  contiguous  regions  of  x  and  y  that  have 
larger  or  lower  z  than  the  threshold  are  designated  as  hills  or  voids  respectively. 

A  useful  analogy  is  that  of  visualizing  the  flooding  with  water  of  a  hilly  region.  The  depth 
of  the  water  determines  the  number  of  visible  hills  and  lakes.  A  high  threshold  corresponds  to 
deep  water.  Figures  1a-b  are  AFM  images  showing  that  metal  films  are  very  rough  at  all  scales. 
For  the  1|im  square  region  (resolution  of  500  x  500  lines)  scans,  the  lateral  resolution  could 
approach  20A  but  we  suspect  the  tips  are  much  wider.  The  z  resolution  is  on  the  order  of  1A. 

The  smallest  features  are  on  the  order  of  500  A  in  the  xy  plane  and  most  of  the  voids(lakes) 
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have  the  same  shape  near  their  bottom.  Since  it  is  arbitrary  how  we  define  the  zero  point  level 
for  the  z  direction,  we  can  create  hills  or  voids  arbitrarily  by  our  choice  of  z=0  elevation.  Some 
hills  are  not  of  higher  z  than  the  water  level.  Analyzed  from  this  point  of  view,  the  question  of 
thresholding  becomes  a  percolation  problem  involving  the  formation  of  a  single  contiguous 
flooded  region  or  land  region.  The  number  of  voids  and  hills  Is  a  strong  function  of  threshold 
although  it  is  thought  that  the  fractal  dimension  calculated  for  a  particular  choice  is  much  less 
sensitive  if  chosen  far  from  the  percolation  threshold. 

Figure  2  shows  a  graph  of  the  number  of  hills  and  lakes  found  as  a  function  of  z  threshold 
for  a  particular  AFM  image  which  is  typical  of  those  studied.  The  film  was  7000A  of  evaporated 
aluminum  on  silicon.  Both  substrate  and  film  were  passivated  so  we  suspect  that  our  images  are 
obtained  through  a  conformal  oxide  layer  on  the  order  of  20-30A  thick.  The  thresholding  is 
normalized  such  that  the  limits  of  the  abscissa  are  0-100%  of  the  maximum  z  found  for  that 
image.  Figures  3a-c  show  graphs  of  the  surface  fractai  dimension  caiculated  for  a  range  of 
thresholds  for  a  few  different  films.  Figure  4  shows  a  series  of  typical  area-perimeter  plots 
based  on  either  lakes  or  hills  obtained  from  another  image.  These  results  reveal  much  about  the 
structure  of  these  films. 

Figure  2  suggests  that  a  cut  into  a  film,  along  a  plane  perpendicular  to  its  surface  would 
reveal  a  profile  like  that  shown  in  Figure  5a  in  which  hiiiocks  are  taken  as  spherical.  The  point 
here  is  whether  the  bases  of  different  sets  of  adjacent  hillocks  make  contact  at  different  z 
values  or  whether  a  picture  more  iike  Figure  5b  is  appropriate  in  which  there  are  very  flat 
regions  separating  isolated  hemispherical  hillocks.  The  situation  depicted  in  Figure  5b  would  go 
from  zero  hills  to  the  maximum  found  within  the  smallest  z  increment  instrumentally  possible 
and  then  decline  in  numbers  monotonically  for  increasing  z  thereafter.  When  there  are  hiiiocks 
on  top  of  hillocks,  increasing  certain  z  thresholds  will  cover  more  hills  than  they  will  separate. 
This  results  in  the  number  of  hills  or  voids  found  not  immediately  being  a  maximum  value  at  the 
lowest  z  value  for  which  hiils  are  found. 

In  the  midwestern  United  States,  a  favorite  country  witicism  is:  "For  every  hill  there's  a 
hole."  The  land  is  so  flat  that  the  statement  is  very  often  true  in  actual  fact.  For  every  film 
studied  there  was  always  a  choice  of  z  threshold  possible  for  which  the  number  of  hills  was 
equal  to  the  number  of  voids.  This  was  always  very  close  to  the  percolation  threshold.  It  is  clear 
that  there  is  not  perfect  symmetry  between  the  number  of  hills  or  voids  as  a  function  of 
threshold.  It  is  also  clear  that  the  fractal  dimension  is  a  function  of  z  threshold  and  that  it 
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sometimes  seems  as  if  there  is  a  systematic  variation  occurring.  This  deserves  more  study  as 
does  a  more  quantitative  study  of  the  variation  of  the  number  of  hills  and  voids  with  choice  of  z 
threshold. 

The  variability  may  be  unfortunate  if  we  eventually  want  to  establish  a  reproducible 
protocol  for  thresholding  that  will  allow  calculation  of  fractal  dimension  that  can  be  compared 
from  sample  to  sample,  as  a  function  of  electromigration  or  thermal  annealing  or  perhaps  after 
various  other  types  of  film  processing.  While  unfortunate,  it  still  must  be  admitted  that  the 
entire  range  of  surface  fractal  dimensions  found  for  the  AFM  images  of  aluminum  was  roughly 
2.571.1 1  regardless  of  whether  voids  or  hillocks  were  used  in  the  area-perimeter  plots.  If  a 
systematic  part  of  the  variability  exists  and  can  be  quantitatively  analyzed  then  we  suspect  that 
the  uncertainty  on  the  fractal  dimension  measurement  could  be  reduced  by  an  order  of  magnitude 
or  more.  This  would  greatly  increase  the  potential  utility  the  observable  fractal  dimension  for 
various  applications.  Although  we  were  only  able  to  obtain  a  few  usable  images,  it  should  be 
noted  that  for  laser  deposited  platinum  films  on  quartz,  the  fractal  dimension  found  from  area- 
perimeter  plots  was  2.55  ±  .09. 

Graphs  illustrating  the  other  type  of  analysis  mentioned  by  Stanley  involving  changing 
the  size  of  the  region  observed  and  counting  the  number  of  objects  are  shown  in  Figure  6.  The 
slopes  of  these  graphs  for  the  aluminum  films  give  a  surface  fractal  dimension  of  =2.55  while 
for  the  platinum  film  we  obtain  2.9.  Why  the  aluminum  films  seem  to  give  better  agreement 
with  the  value  obtained  from  area-perimeter  plots  compared  with  the  agreement  displayed 
using  platinum  film  we  cannot  say.  The  fact  that  the  fractal  dimension  obtained  using  hills  is  not 
distinguishable  from  the  value  obtained  from  voids  supports  our  hope  that  we  may  be  able  to 
connect  the  behavior  of  hills  predicted  using  our  model  with  that  of  the  voids. 

These  fractal  dimension  measurements  can  be  used  as  a  check  of  the  quantitative 
consistency  of  our  coalescence  growth  model.  To  utilize  (4),  we  assign  a  =  -.5  and  dw  =  2. 

This  follows  from  assuming  that  hillock  growth  occurs  by  diffusive  motion  of  very  small 
clusters  and  that  the  velocity  of  the  mass  transfer  scales  as  the  inverse  square  root  of  the  mass 
of  the  cluster,  consistent  with  the  usual  simple  kinetic  theory.  The  fractal  dimension  of  the 
surface  of  the  of  the  films,  which  may  be  where  most  of  the  material  involved  in  coalescence 
growth  can  be  found  just  before  coalescence  events  occur,  is  d  =  2.55.  This  number  therefore 
corresponds  to  the  dimension  of  the  space  in  which  the  coalescence  growth  is  occurring.  Finally, 
the  dimensionality  of  the  hillocks  themselves  is  also  obtained  by  adding  unity  to  the  slope  of  the 
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area-perimeter  plots  giving  D  =  2.57.  Using  (4)  we  predict  co  =  -0.29  which  can  be 
compared  quite  favorably  with  co  =  -0.24  from  Vook's  ln(b)  vs  ln(t)  plots. 

We  obtained  perimeter-area  plots  using  SEM  data  for  comparison  with  the  same  kinds  of 
plots  based  on  AFM  data.  Vook's  data  was  not  available  for  this  purpose  since  he  only  reported 
diameters  as  his  raw  data.  For  this  purpose,  Professor  Kime^®  allowed  us  to  analyzed  the  data 
she  had  obtained  last  summer.  She  obtained  area-perimeter  data  on  over  1000  voids  of  similar 
passivated  aluminum  films.  The  plot  obtained  from  this  data  is  in  Figure  7  and,  remarkably, 
over  900  voids  form  a  reasonable  line  giving  a  surface  fractal  dimension  of  2.71  in  good 
agreement  with  the  values  obtained  from  AFM  data.  Many  of  the  voids  which  deviated 
significantly  from  the  line  shown  were  in  fact  on  the  lower  limit  of  the  resolution  of  the 
instrument.  This  is  a  common  pitfall  causing  the  measured  areas  to  depend  of  the  instrumental 
resolution  and  that  data  would  have  had  to  be  discarded  on  that  basis  regardless  of  any  other 
considerations. 

Finally,  we  utilized  the  remarkable  data  of  Thomas  and  Calabrese^  ^  involving  the 
evolution  of  a  single  void  under  electromigration  conditions.  A  single  void  was  observed  under 
nominally  constant  conditions  for  a  period  of  hours  with  images  being  obtained  every  few 
minuted  or  so.  This  data  was  converted  to  a  standard  area-perimeter  data  set  as  shown  in  Figure 
Sexcept  that  all  the  areas  and  perimeters  correspond  to  the  exact  same  void.  The  plot  yields  a 
reasonably  straight  line  giving  a  surface  fractal  dimension  of  2.67.  It  appears  that  the  long  time 
average  behavior  of  a  single  microscopic  void  can  be  replaced  by  a  spatial  average  of  a  large 
number  of  voids  at  a  single  time.  We  believe  this  to  be  a  singular  result  in  science.  One  of  the 
basic  precepts  of  statistical  mechanics  is  explicitly  confirmed  by  direct  observation. 

This  final  result  is  profound.  It  again  shows  that  the  fractal  dimension  of  voids  and  hills 
are  nearly  equal,  that  AFM  and  SEM  data  are  comparable,  that  the  fractal  dimension  of  the  voids, 
and  therefore  probably  hillocks,  remains  constant  throughout  their  evolution.  Despite  the  fact 
that  the  void  changes  its  exact  shape  repeatedly  in  their  data,  the  fractal  dimension  could  be  used 
in  (4)  to  follow  the  growth  of  a  distribution  of  such  hillocks(voids). 

Summary  and  Conclusions 

At  the  precision  available  to  us,  voids  and  hillocks  are  self  similar  across  spatial  scales 
spanning  greater  than  three  orders  of  magnitude  and  they  appear  to  have  the  same  fractal 
dimensionality.  Fractal  exponents  obtained  from  either  AFM  or  SEM  images  can  be  inserted  into 
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the  equation  governing  the  scaling  parameter  co  from  our  kinetic  model  of  hillock  evolution. 
Direct  observation  shows  that  the  fractal  dimensionality  of  an  electromigration  driven  void  does 
not  vary  with  time  during  its  growth.  The  fractal  exponent  for  passivated  aluminum 
metallization  patterns  is  2.57±.22(2a). 
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Figure  2:  Number  of  hillocks  (or  voids)  found  as  a  function  of  z-threshold. 


Figure  3a-b:  Fractal  dimension  found  as  a  function  of  z-threshold  for  2  typical  films.  Some 
systematic  variation  seems  apparent. 
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Figure  4:  Typical  area-perimeter  plot  of  hillocks  obtained  from  an  AFM  image 


Figure  5:  Surface  profile  for  close  packed  hillocks  (a)  and  non-overlapping  hillocks  (b). 


ln(L) 

Figure  6:  Number  of  hillocks  (lakes)  found  for  a  particular  z-threshold  as  a  function  of  size  of 
the  region  (L)  examined.  500x500  and  400x400  point  resolution  data  are  used.  Ordering  of 
the  fits  matches  the  ordering  of  the  legend. 
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Figure  7 


y  =  -3.4686  +  1.7653X  R2=0.988 


2  3  4  5 

Perimeter  {\i.) 

:  Area  perimeter  plot  obtained  from  Professor  Kime's  data  (see  text) 


Figure  8:  Area  perimeter  plot  obtained  from  Thomas  and  Calabrese  data  (see  text) 


ON  TESTING  THE  EQUALITY  OF  COVARIANCE  MATRICES  UNDER  SINGULARITY 


Pinyuen  Chen 
Professor 

Department  of  Mathematics 


Syracuse  University 
215  Carnegie 
Syracuse,  NY  13224-1150 


Final  Report  for: 

Summer  Faculty  Research  Program 
Rome  Laboratory 


Sponsored  by: 

Air  Force  Office  of  Scientific  Research 
Bolling  Ar  Force  Base,  DC 
and 

Rome  Laboratory 


August  1994 


7-1 


ON  TESTING  THE  EQUALITY  OF  COVARIANCE  MATRICES  UNDER  SINGULARITY 


Pinyuen  Chen 
Professor 

Department  of  Mathematics 
Syracuse  University 

Abstract 


This  report  considers  the  problem  of  statistically  testing  the  equality  of  two  covariance  matrices  from 
complex  multivariate  normal  distributions.  The  underlying  distribution  is  assumed  to  be  singular  in  the 
sense  that  one  of  the  two  sample  covariance  matrices  has  rank  one  and  the  other  is  of  full  rank.  A  test 
procedure  is  proposed  and  its  distributions  under  null  and  alternative  hypotheses  are  derived.  Two 
interesting  properties  of  the  test  procedure,  invariance  and  monotonicity  properties  of  the  power  function 
are  studied  for  a  general  setting,  and  then  apply  to  the  special  case  studied  in  this  project.  The  test 
procedure  is  also  shown  to  be  equivalent  to  several  other  well-known  test  procedures  which  were 
previously  studied  only  for  the  regular  case,  that  is,  when  both  sample  covariance  matrices  are  of  full 
rank.  Some  suggestions  for  further  research  are  given  in  the  last  two  sections  of  this  report. 
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ON  TESTING  THE  EQUALITY  OF  COVARIANCE  MATRICES  UNDER  SINGULARITY 


Pinyuen  Chen 


1,  Introduction 

A  signal  detection  problem  is  usually  described  in  terms  of  a  model  X  =  s  +  n  where  5  is  a  fixed 
signal,  n  ^  CNp(0,Rj)  is  an  noise  variable,  and  Y is  the  received  message.  A  signal  is  said  to  be  detected 

if  the  received  message  X  contains  the  signal  s.  In  statistical  terms,  a  signal  is  claimed  to  be  present  if  the 
null  hypothesis  that  s==0  is  rejected  versus  the  alternative  hypothesis  that  s^a  (specified).  The  above 
testing  procedure  is  based  on  two  independent  random  variables  Y and  where  Y~  CNp(s,Rj)  and  Sj  ^ 
CWp{k,Rj).  In  this  report,  we  study  a  hypothesis  testing  problem  on  the  equality  of  the  covariance  matrix 
ofY  (denoted  by  R'2)  and  the  expectation  of  (denoted  by  Ri).  In  stead  of  testing  if  the  mean  vector  of  Y 

is  null-vector  against  that  it  is  a  given  non-null  vector,  we  are  interested  in  testing  if  the  distribution  of  the 
statistic  Y  comes  from  the  same  family  that  induces  the  statistics  Sj .  This  is  a  meaningful  step  taken  prior 


to  the  testing  problem  on  s  as  we  mentioned  before  because  almost  all  the  inferences  in  signal  detection 
are  based  on  the  assumption  that  the  covariance  matrix  of  Y  is  the  same  as  the  expectation  of  Sj.  An 

earlier  inference  on  the  equality  of  Ri  and  R2  may  be  helpful  in  identifying  an  outlier  or  a  contaminated 
sample  in  the  background  environment.  A  detection  of  a  violation  in  the  model  (that  is,  an  acceptance  of 
Hj:Rj  R2)  will  certainly  impose  investigation  on  the  robustness  of  the  original  procedures  in  testing  s  = 

0.  The  violation  in  the  model  will  also  guide  the  experimenters  to  use  a  different  procedure.  The 
procedure  may  be  less  powerful  than  the  ones  that  were  derived  under  the  assumption  of  Rj  =  R2*  But  it 
certainly  will  give  better  detection  probability  when  the  assumption  ofRj  =  R2  is  not  true. 

The  classical  likelihood  ratio  criterion  in  testing  the  equality  of  two  covariance  matrix  is  not 
appropriate  here  as  we  can  argue  as  follows.  It  is  clear  from  the  literature  (see  formula  (8)  on  page  406, 


Anderson  [1])  that  the  likelihood  criterion  can  be  written  as  ^  . 


where  Sj  and  S2  are  the 


respective  sample  covariance  matrices,  and  nj  and  n2  are  respective  degrees  of  freedom.  In  our 
application  to  signal  detection,  5*^  has  rank  1  and  thus  is  singular  when  p  >  L  This  implies  that  the  above 


test  statistic  A.  is  identically  0,  So  the  likelihood  ratio  approach  is  not  valid  here. 

In  Section  2,  we  introduce  the  notation,  describe  the  problem  in  a  formal  manner,  make  the  necessary 
assumptions  on  the  samples  and  populations,  and  introduce  the  proposed  test  statistic  and  acceptance 
criterion.  We  also  give  the  distributions  of  the  proposed  test  statistic  under  both  null  and  non-null 
hypotheses.  All  the  results  are  given  in  both  the  real  and  the  complex  cases.  Most  of  those  distributions 
are  derived  theoretically.  Only  one  of  them  is  argued  heuristically.  In  Section  3,  we  study  two  important 
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properties,  namely  the  invariance  and  the  monotonicity  property  of  the  power  function,  of  the  proposed 
test  statistic.  Concluding  remarks  and  acknowledgments  are  made  in  the  Section  4. 

2.  The  problem  and  the  proposed  test  procedure 

We  assume  that  are  k  independent  random  vectors  from  a  p-variate  complex  normal 

distribution  CN^{0,R,).  The  random  vector  X^^,  is  distributed,  independently  of  Xj,X2,...,X^  , 
according  to  a  p-variate  complex  normal  distribution  CN^  (0,R,).  Here  both  and  are  unknown.  Our 
goal  is  to  perform  a  statistical  hypothesis  testing  on 
(2- 1)  Hg  :Rj  =  R^  versus  HfR^^R^. 

Our  proposed  test  statistic  is 

(2.2)  ^  ^  ^k+1^1  ^^k+1  where  Sj  is  the  sample  variance-covariance  matrix  of 

Now  we  consider  the  distributions  of  T  for  the  following  cases: 

^  When  the  sample  size  k  is  large,  the  application  of  the  law  of  large  number  to  each  element  in 
the  covariance  matrix  implies  that  5^^  is  a  consistent  estimate  ofRj  So  convergesto  in  probability. 
In  this  case,  we  can  use  to  estimate  R^  and  thus  consider  R^  is  known. 

Under  Hq\R^  =  Chi-square  with  2p  degrees  of  freedom. 

Proof:  We  refer  now  to  Theorem  3.3.3  in  Anderson  [1]  which  considers  the 
distribution  of  T  for  the  real  elements  case.  To  extend  his  result  in  real  elements  case  to 
our  complex  elements  case,  we  change  all  the  y' ,  v’ ,  c ,  z' ,  and  A,’  to 

respectively.  Then  we  will  get  -Z^Z  which  can  be 

considered  as  the  sum  of  squares  of  the  components  of  Z.  Since 
y  T  y  ^  Chi  -  square  with  2p  degrees  of  freedom  with  non-centrality 

parameter  v^T  ^v.  Thus  we  have  T  ^  Chi-  square  with  2p  degrees  of  freedom. 

Under  HpR^  ^  R^\ 

Let  Z=Rj  Then  Z^Z ^  CW^iRj^^^R^Rj^^^ ^])  which  does  not  have  a  regular  Wishart 

density  if/?  >1.  However  T  is  a  quadratic  form  in  normal  variates  whose  distribution  has 

been  approximated  by  many  authors.  Here  we  suggest  the  following  approach. 

We  wish  to  find  the  distribution  of  T,  that  is  P(r<a).  We  first  show  that  the  distribution 
of  T  is  the  same  as  that  of  W^AW  =  +  W^.)  where  X,  >  >...>  X  are  the 

eigenvalues  of  ^2^7^  ^  is  the  diagonal  matrix  of  and 

i-i'^-  Standard  normal  random  variables.  Any 
quadratic  form  in  A'/,JV,,...,A'^  may  be  written  as  Q{X)  =  X” AX  where  is  a  p  by  p 


lA 


symmetric  matrix.  Let  q  =  R2''^y,  then  q"q  =  y" R^' y,and  \R2\=\R2'"^llR2'"^U  we 
obtain 

(2.3)  P(T  <  a)  =  P{Q{X)  <  a)  =  K~^  j  cxp(-q'^q)dq  . 

q"  Rx{Ri''^))q<a 

If  A  is  the  diagonal  matrix  of  eigen  values  of  ^  associated  orthogonal 

matrix  of  eigen  vectors,  then  Making 

another  transformation  W  =  ,  we  obtain 

(2.4)  P{T  <a)  =  P{Q{X)<a)=TZ~’’  J  exp(-H'"w)tiH' . 

Kw<a 

Thus  the  distribution  of  T  is  the  same  as  of  W^AW  =  +^2;)  where 

/=/ 

,  Wyj , ....  W,p ,  W2; ,  W22 .... ,  W2P  are  i.i.d.  standard  normal  random  variables. 

Now  we  consider  Ruben's  [2]  (  Theorem  1,  page  550)  approximation  to  the 
probability  P{T  <  a). 

For  K;  =  a,  =  a2,^2  =01^=  =a2p  we  have 

(2.5) P(r <a)  =  P(tx,{W^,  +W2])<a)  =  pda.Z]  <y)=  (y /a) 

i=/  j=l  1=0 

where  is  the  distribution  of  central  Chi-square  random  variable  with  (2p+2l) 

degrees  of  freedom,  p  is  an  arbitrary  constant,  and 

(2.6)  Cj  =  (27)-' U  >  0),c,  =  n(a / a,  )"^ 

r=0  J=J 

where  =  'Z(l-afa^)”,m  =  1,2,,.. 

j=i 

Another  possible  approach  to  the  distribution  of  T  under  Hj\Rj^R2  is  to  use 

p 

simulation  on  X {^n  +  ^2/ )  1^^  calculate  the  power  at  different  values  of  A,'s. 

i=l 

Case  2:  For  the  case  where  k  is  any  integer  such  that  k  >  p: 

Under  the  null  hypothesis  Hq  .Rj  =R2  : 

First  we  state  a  theorem  which  deals  with  the  distribution  of  T  in  the  real  case. 

Theorem  2.1  (Anderson  [1],  page  163):  Let  T  =  Y'S~^Y,  where  Y is  distributed  according  to  A^(v,X) 
and  nS  is  independently  distributed  as  independent,  each  with  distribution 

a=I 

NiO,!.).  Then  (T  /  n)[(n-p  +  l)/ p]  is  distributed  as  a  noncentral  F  with  pand  n-p  +  1  degrees 
of  freedom  and  noncentrality  parameter  v  .If  v  =  0,  the  distribution  is  central  F. 

To  prove  an  analogous  result  for  the  complex  case,  we  can  proceed  in  a  similar  manner  as  in  the 
proof  of  the  theorem  (see  Anderson  [1],  page  163).  We  need  only  to  change  the  transpose  Z  to  the 
conjugate  transpose  Z",  A(v,I)  to  CiV(v,Z),  and  orthogonal  matrix  to  unitary  matrix  and  the  following 
result  will  follow. 

Corollary  2.2:  Let  T  =  Y”S~^Y,  where  Y  is  distributed  according  to  CA(v,X)  and  nS  is 
independently  distributed  as  with  Z;,Z2 . Z„  independent,  each  with  distribution  CiV(0,Z). 

a=/ 
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Then  ( T  /  «)[(«  -/>  +  /)/  yj]  is  distributed  as  a  noncentral  F  with  2p  and  2(n-p  +  1)  degrees  of  freedom 
and  noncentrality  parameter  v'  E'V  .If  v  =  0,  the  distribution  is  central  F. 


Thus  in  our  notation, 


r~( 


kp 


k- p-\- 1 


2p,2ik-p+I)' 


Under  the  alternative  hypothesis  H ^  \ 


Now  7’=  =  tx{Xl,S]X.,)  =  tr  {X,„X”,,S-/)  =  tr  where  5,  =  X,,,Xl, . 

First  we  consider  the  case  that  all  the  variables  are  real  normal  random  variables.  Since  has  rank 
1  and  S,  has  full  rank  p,  the  rank  of  S^S]'  is  1.  Thus 

(2-7)  r  =  tr  )  =  the  largest  eigen  value  of  S^S]‘ . 


For  non-singular  case  where  both  and  have  full  rank  p  and  orders  n^  and  nj  respectively,  formula 
(16)  of  Khatri  [3]  gives  the  density  of;^,  the  largest  eigenvalue  of  (we  are  considering  the  real 


case  now),  as  follows: 
(2.8) 


3^2(^»,+^n;,^p  +  l,^p-^-,^p,^{n,+p  +  iy,f^{A+f^I)-‘) 


where 


^  =  r(^)r/^«,+^«,)rp_,(^/,+;){r(-J 


/')r(  j«;)r,(^«,)r,_,(|/i,  +  j)r^ 


—p(p~l)  P  ]  J 

^p(t)  =  n^  nr(f--7  +  -),  A  =  diagiXj,X„...,X  ), 

j=i  2  2 

here>l's  are  the  eigen  values  of  R2R]\  and  is  the  hypergeometric  function  in  matrix  argument  as 

defined  in  James  [4].  Since  the  derivation  of  the  distribution  of  T  for  our  case  involves  the  simplification 
of  3^2  *  need  to  write  down  the  complete  definition  of  the  hypergeometric  fimction  in  matrix 
argument.  We  consider  the  partition  k  =  (k„k2,...,kJ,kj>k^>..>k„>0,kj+k^+...+k„=k,  of  k 

into  not  more  than  m  (the  number  of  variables  in  each  vector)  parts.  Then  from  page  477  of  James  [4],  we 
have 


Definition  2.1 


(2.9) 


k=0k 


k\ 
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are  real  or  complex  constants  and  the  multivariate  hypergeometric 

coefficient 


(2.10) 


i=l  2 


where  (a)j^  =fl(a4*7)(ii'f 2)*”(fl  +  *-7). 

It  is  clear  from  the  arguments  made  in  Anderson  [1],  Section  13.2.4  that,  in  order  to 
obtain  the  density  function  of  the  roots  of  \A  for  the  singular  case  (that  is, 

when  the  rank  of  A  is  less  than  /?),  we  only  need  to  make  the  changes  in  the  density 
function  of  the  roots  of  \A-f{A+B)\  by  replacing  p  by  the  new  n^  ,  nj  by  the  new  p, 
and  n2  -  p  - 1  by  the  new  n2  -  p  - 1  (that  is,  replace  n2  by  the  new  nj^  +  n2  -  p).  Thus  in 
formula  (2.8)  by  changing  (n^  ,n2,  p)  to  (p,  nj  ,ni  +  n2  -  p),  we  obtain 


--P 


(2.11) 


where 
d  = 


<^iAi  ir  v+ZpA'^i  ^ 


r(4)r„,  (4«3  +n,- p)r„^_,{jr,,  +^P  +  ^)r‘ 


r(  j)r„,  ( +jn,){r(jn,)r(jp)r„_  (j(n,+n, +\p+\)r‘ 

and  A  is  the  diagonal  matrix  with  n^  elements  >  ^/i,  • 

Now  we  replace  n^  by  1  in  the  fimction  in  (2.10).  The  third  argument  in  the  first  3- 
vector  becomes  0. 

For  any  k  >0,  since  >  0,  we  have 

(2.12)  (a^),  =  (0),  =  (0)*^  (- j),^  -  =  0 


For  /t  =  0,  by  Formula  (17)  in  James  [4],  we  get 
(2.13)  (^)A:"1  for  all  a. 

Thus  we  obtain,  in  (2.11)  above,  function  becomes 


(2.14)  /  /  ?  /  7 

=  2:c^(4(A+4/)-') 

k 

where  k  is  the  partition  for  k=0.  Moreover  by  (17)  in  James  [4], 
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(2.15) 


(trS)*=ZC^(S). 


Thus  the  function  in  (2.10)  is  equal  to  1.  Now  by  letting  nj^  =  1  in  the  density 
function  (2. 1 1),  we  finally  obtain  the  density  of  T  =f„  : 


(2.16) 


/(/J  = 


rt—  — 

r(-)r(-) 


Thus  Pearson  T\pe  VI  distribution  with  p  and  ri2-  p  +7  degrees  of  freedom. 

That  is 
(2.17) 


~ — ~ — (~)  ^  F  distribution  with  p  and  n2-p  degrees  of  freedom. 
P 


Now  we  consider  the  complex  case  by  a  heuristic  argument.  The  analogous  theory  for  the 
density  function  of  the  largest  eigen  value  of  the  matrix  does  not  exist  as  for  the  largest 
eigen  value  of  the  matrix  S^S~/  in  the  above  real  case.  However,  based  on  the  facts  that  (1)  since 

both  and  are  positive  Hermitian  and  thus  their  eigen  values  are  positive  real,  so  are  the 
eigen  values  of  ,  (2)  the  F  distribution  can  usually  be  considered  as  the  ratio  of  two  Chi- 

square  random  variables,  and  (3)  most  of  the  Chi-square  random  variables  involved  in  the  complex 
case  have  twice  as  many  degrees  of  freedom  as  in  the  analogous  real  case,  we  conjecture  that  the 
density  function  of  the  complex  case  has  the  following  property: 


(2.18) 


n  2-p  +  l  J, 


(~)  F  distribution  with  2p  and  2(n2  -  p  +1)  degrees  of 


P  '^2 
freedom. 

Here  we  suggest  two  methods  to  verify  the  above  conjecture  on  the  distribution  of  7. 

Method  1:  Simulation  method. 

First  generate  from  a  normal  variates  generator  k  independent  complex  normal  samples  from 
-  CNp{0,Rj)  and  an  independent  complex  normal  sample  from  X^^^j  -  CN^iO^R^) 

where  both and  are  given.  Calculate  the  statistic  where  is  the  sample 

variance-covariance  matrix  of  {  }.  Use  Kolmogorov  test  (see  for  example, 

Lehmann[6])  to  test  the  null  hypothesis  that  — — ^  F  distribution  with  2p  and  2(n2  -  p 


rl)  degrees  of  freedom  where  is  the  largest  eigen  value  of  the  known  matrix  R^R/ . 


Method  2:  Theoretical  proof 

Goodman  [5]  has  studied  distributions  derived  from  a  sample  of  k  independent  observations 
from  a  complex  p-variate  normal  distribution.  However,  the  distribution  of  the  largest  eigen  value 
of  the  matrix  was  not  derived  in  Goodman  [5].  Khatri's  [3]  results,  which  deals  with  real 

normal  random  vectors,  may  be  generalized  to  complex  case.  In  order  to  extend  the  results  of  the 


i 
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density  function  of  the  largest  eigenvalue  of  the  matrix  which  was  given  in  Khatri  [3],  we 

need  to  prove  the  lemmas  similar  to  those  in  Khatri  [3]  for  the  complex  case  by  using  the 
hypergeometric  functions  in  matrix  argument  for  the  complex  case.  Thus  we  need  to  define 
the  hypergeometric  functions  pF^  in  matrix  argument  for  the  complex  case  appropriately.  James 
[4]  has  some  definitions  and  some  properties  of  in  matrix  argument  for  the  complex  case.  But 
it  is  clear  that  not  all  of  them  are  useful  in  our  research,  and  in  our  research  we  need  more 
properties  as  given  in  James  [4]. 

It  can  be  seen  from  the  description  of  the  two  methods,  the  verification  of  (2.18)  is  interesting 
and  important  for  both  theoreticians  and  practitioners.  The  author  proposes  to  study  these  two 
methods  in  a  follow-up  research. 
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3 .  Invariance  and  monotonocitv  properties 

Invariance  is  a  mathematical  property  for  symmetry  and  in  practice  many  statistical  testing 
problems  exhibit  symmetries.  It  is  a  widely  accepted  principle  that 

if  a  hypothesis  under  consideration  is  invariant  under  a  transformation,  then  a  good  test  statistic  or 
the  critical  region  should  also  be  invariant  under  the  same  transformation.  The  reason  for  the 
intuitive  appeal  of  an  invariant  test  procedure  is  the  desire  for  a  unique  best  way  to  analyze  a 
collection  of  data.  A  well-known  invariant  test  in  multivariate  statistical  analysis  (see,  for 
example,  Anderson  [1],  page  181)  is  the  Hotelling  T-  test  for  testing  the  mean  vector  of  a  normal 
random  vector.  It  is  uniformly  most  powerful  among  all  tests  of  ^  (mean  vector)  =  0  based  on  the 
sufficient  statistics  (^^,^4  =  2(A^oi  “^)')  ^re  invariant  with  respect  to 

transformations  X  =  CX,A  =  CAC ,  where  C  is  a  non-singular  matrix.  For  a  general  treatment 
of  invariance,  see  Lehmann  [6].  For  the  invariant  properties  of  some  testing  procedures  in 
multivariate  statistical  analysis,  refer  to  Anderson  [1],  These  texts  do  not  deal  the  exact  problem 
that  is  considered  in  this  research.  However,  in  the  following  we  will  review  the  background  of 
invariance  and  apply  the  invariance  idea  to  our  problem. 

Definition  3.1: 

(1)  A  function  T(x)  defined  on  the  sample  space  X  is  invariant  under  the  group  of  transformations 
G  if  T(x)=T(gx)  for  all  x  in  X  and  g  in  G. 

(2)  An  invariant  function  is  a  maximal  invariant  under  the  group  of  transformation  G  if  T(x)=T(y) 
for  all  X,  y  in  X  implies  that  there  exists  a  g  in  G  such  that  y==gx. 

Our  problem  of  testing  Hq :Rj  =  versus  ^  R2  is  invariant  under  the  group  of 

transformations  G  of  p  x  p  nonsingular  complex  matrices  which  transform  x  to  gx  for  g  in  G  since 
for  X  --  CNp(0,R),  CX  ~  CNp(0,CRC^)  which  is  in  the  same  class  of  distributions  as  x,  and 

gRjg^  =  gR^g"  implies  We  are  now  ready  to  show  that  the  set  of  eigenvalues  of  R2R]' 

is  a  maximal  invariant  under  the  group  of  transformation  G. 

Theorem  3.1:  A  set  of  maximal  invariants  under  the  group  G  comprises  the  eigen  values  of 
that  is,  the  roots  of 
(3.1) 

Proof;  Since  -XI)g^\=\R^Rj‘  ->./|  ,  the  roots  of  equation  (3.1)  are  invariant  under  G. 

To  show  that  they  are  maximal  invariant,  we  suppose  that  {R^R]'  -XI\=  0  and.  \A-XI\=  0  have 

the  same  roots  where  ^1  is  a  hermitian  positive  definite  matrix.  We  want  to  show  that  there  exists  a 
g  in  G  such  that  A  = 
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Since  both  A  and  are  Hermitian  and  positive  definite,  there  exist  unitary  matrices 

gj,g}  e  G  such  that 

g,(R,R^‘)gf  =  ^  =  g,Bg^ 

where  A  is  a  diagonal  matrix  whose  elements  are  the  roots  of  the  equation  (3.1).  Thus  we  get 
B  =  g2giiB,R'/)g^g2  where  g  =  g”gi  e  G. 

Similarly  the  maximal  invariants  in  the  sample  space  are  the  roots  of 

(3.2)  \S,S]^-}J\=\S,-\S,\=0. 

Any  invariant  test  statistic  or  invariant  critical  region  can  be  expressed  in  terms  of  the  roots  of  the 
equation  (3.2).  The  null  h>TX)thesis  is  rejected  if  the  smaller  roots  are  too  small  or  if  the  larger 
roots  are  too  large,  or  both.  In  terms  of  the  eigen  values  ?  Ihe  null  hypothesis  can  be  written 

as 

(3.3)  =  =  -  =  ^p  • 

Any  useful  invariant  test  of  Hq  has  a  rejection  region  in  the  space  of  /^  <  /^  <...</^  that  includes 
the  points  that  in  some  sense  are  far  from  /^  =  =.  *=/p  =  The  power  of  an  invariant  test 

depends  on  the  parameters  through  the  rootsX^  .  It  is  clear  from  (2.7)  that  the 

critical  region  of  our  test  depends  only  on  the  trace  of  S2Sf  which  is  equivalent  to  the  largest  eigen 

value  in  our  case.  Thus  it  is  an  invariant  test.  Therefore,  the  power  function  should  depend  on  the 
parameters  only  through  the  roots  . 

We  now  discuss  another  interesting  property  of  the  test.  It  is  based  on  the  following  theorem 
and  corollary  of  Anderson  and  Das  Gupta  [7]. 

Consider  two  independent  random  matrices  Uj  of  dimension  and  of  dimension 

such  that  the  column  vectors  of  are  independently  and  normally  distributed  with  mean 
0  and  covariance  matrix  and  the  column  vectors  of  are  independently  and  normally 
distributed  with  mean  0  and  covariance  matrix  •  Then  we  have 

Theorem  3.2  (  due  to  Anderson  and  Das  Gupta  [7]): 

Let  o  be  a  set  in  the  space  of  -  characteristic  roots  of  iJJ P 2^ 2)  ^ 

that  when  a  point  (r^,r2’...,r^)  eco,  so  is  every  point  (F;,^^,.  for  which  = 

Then  the  probability  of  the  set  co  depends  on  and  Z^  only  through  {Xj,,..,Xp)  and  is  a 
monotonically  decreasing  funtion  of  each  >1, . 

Corollary  3.3  ( due  to  Anderson  and  Das  Gupta  [7]): 

If  an  invariant  test  with  respect  to  G  has  an  acceptance  region  co'  such  that  if  (r;,r2,...,rp)  eoo’ , 

so  is  (fjJ2.  -Jp)  for  which  /;.  < r,.,j  =  7,...,p,  then  the  power  function  of  the  test  is  a 
monotonically  increasing  function  of  each 
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The  above  theorem  and  corollarj'  were  shown  for  the  case  of  real  multivariate  normal 
populations.  To  the  author’s  best  knowledge,  the  results  have  not  been  shown  for  the  complex 
multivariate  normal  case  yet.  However,  it  is  conjectured  that  both  the  theorem  and  the  corollary 
are  still  true  for  the  complex  case. 

Since  our  test  is  an  invariant  test  from  Theorem  3.1,  the  power  function  should  have  the 
monotonicity  property  as  described  in  the  corollary  of  Theorem  3.2. 
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4.  Concluding  remarks 

Let  X;(/»x/t^)  and  X^ipxn^)  be  independent  matrix  variates,  columns  of  X,  being 
independently  distributed  as  CN{0,Ri)  and  those  of  Xj  independently  distributed  as  CN{0, R2) . 
In  testing  the  equality  of  Rj  and  R^,  Various  test  criteria  have  been  proposed  and  studied  in  the 
literature.  Let  f,<f2<..<f„  be  the  eigen  values  of  |X;X';  -  0  and  >  ^2  >...> 

be  the  eigen  values  of  -  /Rj|=  0.  These  test  criteria  can  be  summarized  as  follows: 

(1)  Wilks  criterion  (Wilks  [8]),  W=  flil+fiT' 

i=l 

(2)  Roy's  largest  root  (Roy  [9]) :  , 

(3)  Hetelling's  trace  (Hotelling  [10]),  U  =  'Zfi  ? 

i=i 

(4)  Pillai's  trace  (Pillai  [11]),  V  =  Z/i 

i=l 

1  1 
I  -rtj  I 

IX  X  P  IX  X  P 

(5)  Modified  likelhood  ratio  (see  Anderson  [1]),  'k  =  — - — - ^-7^ - 

|X,X,+X,X,p 

(6)  Nagao's  criterion  (Nagao  [12]),  W  =  — ' — ^(X;X^  h-X^Xj)'^ -I]^  . 

2  i=l 

It  is  clear  that  tests  (1)  to  (4)  are  all  functions  of  the  roots  of  |X;X;  -yX^Xj|=  0.  So  by 
Theorem  3.1  in  the  last  section,  they  are  all  invariant  tests.  It  can  also  be  shown  that  the  test 
criteria  (5)  and  (6)  are  also  invariant  for  the  group  G  of  transformations  defined  in  Section  3  (see 
page  408,  Anderson  [1]).  It  is  readily  verify  that  tests  (1)  to  (4)  satisfy  the  conditions  of  Theorem 
3.2.  Thus  they  all  possess  the  monotonicity  property  for  their  respective  power  functions.  It  is  not 
trivial  that  test  (5)  satisfies  the  condition  required  by  Theorem  3.2.  But  test  (5)  certainly  has  its 
own  attractive  properties  as  described  in  Anderson  [1].  Chu  and  Pillai  [13]  made  some  numerical 
power  comparisons  for  the  tests  (1)  to  (5)  for  the  regular  case,  that  is,  for  both  n^  ,  n2  >  p,  only. 
Their  results,  however,  are  for  the  real  case  only.  Due  to  the  complications  of  the  computation, 
their  comparisons  are  limited  to  p  =  2  and  3.  Even  for  the  limited  cases  considered  by  them,  they 
made  no  general  conclusion  on  which  test  is  uniformly  better  than  the  others.  So  far  no  attempt  has 
been  made  by  anyone  to  compare  the  powers  of  the  test  criterion  (6)  to  any  other  criterion  on 
testing  the  equality  of  covariance  matrices.  As  we  have  mentioned  earlier  in  Section  1,  test  (5)  is 
inappropriate  for  the  singular  case  that  we  are  considering.  The  motivation  behind  Nagao's  test  (6) 
can  be  briefly  summarized  as  follows.  Nagao  [12]  showed  that  the  variance  of  the  limiting 
distribution  of  -2logX(A  =  likelihood  ratio  criterion)  can  be  expressed  as 
(4.1)  =  i(/i,. /n)tr[R,in,R2+n2R:y' -If 

i=; 

Thus  for  testing  the  equality  of  two  covariance  matrices  R,,R2,  we  replace  the  population 
covariance  matrices  in  (4. 1)  by  the  sample  covariance  matrices  and  obtain  the  test  criterion  (6). 
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From  the  above  discussion,  only  test  (5)  is  not  appropriate  for  the  singular  case.  Thus 
comparing  tests  (1),  (2),  (3),  (4),  (6)  for  the  complex  case  as  well  as  for  the  real  case  for  the 
singular  case,  both  theoretically  and  munerically,  could  be  an  important  and  challenging  task. 

As  for  our  problem  described  in  Section  2,  n^  =  1.  Thus  /;  =/^  =...  =  fp_,  =  0<f^  ,  the 

Wilks  test  statistic  is  =  J I (] + f^,)  whose  critical  region  is  equivalent  to  our  test  T.  The  Roy's 
test  statistic  is  exactly  the  same  as  our  test  statistic  T.  Hotelling's  test  statistic,  in  our  case,  is 

P 

^='Zfi=fp  which  is  again  exactly  the  same  as  our  T.  Filial 's  test  statistic  can  be  rewritten  ,  in 

our  case,  as  ^  =  'Zfi  I  =  fp  /  {l+fp)  =  I- 1 1  {1+/^)  =  I  -W.  It  is  also  equivalent  to  our 

test  T.  Thus  all  the  tests  (1)  to  (4)  are  either  exactly  the  same  as  our  test  T  or  can  be  considered 
equivalent  to  our  test  T.  Test  (6)  is  derived  from  the  limiting  distribution  of  test  (5)  whose  original 
form  can  not  be  used  in  singular  case.  Thus  it  is  meaningful  to  compare  the  performances  of  our 
test  T  and  test  (6)  in  both  real  and  complex  cases. 
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ABSTRACT 

In  this  work  we  analyze  a  detector  on  the  basis  of  a  model  whereupon  each  signal  component 
consists  of  the  radiant  thermal  energy  contributed  by  the  emissivity  and  the  blackbody  radiation  reference 
(BBRR)  of  either  the  small  low-observation  target  or  the  intense,  highly  structured  background.  A  low- 
order  power  series  is  employed  to  approximate  the  BBRR.  A  sufficient  statistic,  when  the  underlying 
sensor  noise  and  spectral  emissivities  are  normally  distributed,  is  derived.  It  is  shown  that  the  new 
detector  is  effective  in  enhancing  the  target  and  suppressing  cluttered  background  on  a  pixel-by-pixel 
basis.  It  is  also  shown  that  under  more  regular  conditions,  the  new  detector  represents  other  classes  of 
detectors,  including  the  constant  emissivity  detector  and  the  maximum  a  posteriori  type  detector.  This 
suggests  that  the  new  class  of  detectors  performs  at  least  as  well  as  other  classes  of  detectors  and  yields 
better  performance  when  some  restrictive  conditions  are  relaxed.  Numerical  computations  demonstrate 
the  superior  performance  of  the  proposed  detector. 
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I.  INTRODUCTION 

The  detection  of  small  low-intensity  observable  target  in  high  intensity  clutter  is  a  difficult  task 
and  is  of  growing  importance  due  to  numerous  applications,  including,  to  name  a  few,  geological  remote 
sensing,  mineralogic  exploration,  environmental  monitoring,  and  ground/airborne  target  identification. 
These  applications  are  characterized  by  spatial  resolution  sufficient  to  barely  resolve  the  target  imbedded 
in  intense,  highly  structured  background  clutter.  Classical  spatial  filtering  techniques  of  single  band 
infrared  (IR)  imagery  do  not  provide  high  probability  of  detection  with  a  low  false  alarm  rate.  One  way 
to  achieve  satisfactory  results  for  these  demanding  cases  is  to  apply  multi-spectral  signal  processing 
techniques  which  exploit  spectral  differences  between  the  target  signature  and  the  background.  In  fact, 
several  recent  studies  [l]-[4]  indicate  that  on  the  basis  of  multispectral  emissivity  differences  between  the 
target  and  the  background,  the  detectability  of  small  man-made  objects  viewed  in  a  natural  background 
clutter  may  be  improved  dramatically.  Research  work  in  this  area  has  been  stimulated  primarily  by  the 
development  of  imaging  sensors  capable  of  collecting  high-quality  spectral  data  in  the  thermal  IR  region, 
such  as  the  NASA  Thermal  Infrared  Multispectral  sensor  (TIMS)  and  the  DARPA-sponsored 
Multispectral  Infrared  Camera  (MUSIC). 

However,  the  effective  radiance  incident  upon  the  detectors  of  the  sensors,  after  it  has  been 
compensated  for  noise  effects,  and,  for  any  particular  bandwidth,  is  a  nonlinear  function  of  emissivity 
and  temperature  of  the  emitting  target/background.  The  sensor  system  simply  cannot  separate  the 
contribution  from  the  emissivity  and  temperature  that  constitute  the  radiaince  value.  Since  detection  has 
to  be  made  based  on  spectral  radiances,  it  becomes  necessary  to  accurately  specify  temperatures  of  the 
targets  and  backgrounds  in  order  to  characterize  the  emissivities  properly.  One  approach  is  to  assume 
that  the  temperatures  of  the  target  and  the  background  are  the  same,  so  that  the  difference  of  the  scene 
data  in  one  band  with  respect  to  the  scene  data  in  the  other  spectral  band,  on  a  pixel-by-pixel  basis,  is 
the  target  residual  in  the  output  image  because  of  the  coloring  between  spectral  bands  [5], [6]. 

In  real  applications,  temperature  of  the  target  (e.g.,  engine  plume)  may  be  drastically  different 
from  the  temperature  of  the  background,  or  the  temperature  may  deviate  from  information  at  hand.  Also, 
sensor  noise  may  be  correlated  for  various  color  bands.  Furthermore,  prior  knowledge  regarding 
emissivities  of  the  target  and  the  background  may  be  very  limited.  However,  if  the  parameters  for  the 
target  and  background  used  to  generate  the  radiance  statistics  for  detection  are  not  the  same  as 
encountered  in  the  real  world,  the  detector  may  suffer  enormous  losses.  To  overcome  these  problems, 
a  new  approach  is  presented  in  this  paper. 
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Section  II  introduces  a  radiance  signal  model  including  the  effects  of  both  temperature  and 
emissivity  for  the  target  and  the  background.  This  is  followed  by  the  approximation  of  L.(7),  the  sensor 
response  to  the  black  body  radiance  reference.  Section  III  presents  a  derivation  of  a  sufficient  statistic 
of  an  optimal  likelihood  ratio  detector  based  on  the  assumption  of  a  Gaussian  distribution  for  sensor  noise 
and  emissivity.  Section  IV  indicates  the  generality  of  the  proposed  detector  and  some  of  its  relevant 
properties.  Section  V  compares  performance  of  the  proposed  detector  relative  to  others  via  numerical 
computations.  Further  suggestions  and  conclusions  are  given  in  Section  VI. 


II.  PRELIMINARIES 

Let  the  V-variate  x  =  [x^  •••  be  the  multiband  radiant  energy  observable  to  a  multispectral 
sensor  with  N  channels.  Each  component  of  the  observation  x.  =x(X.)  is  an  additive  sum  of  radiant 
energy  and  sensor  (channel)  noise  w.  for  a  specific  wavelength  region  (X.-’AaX.,  X.+'/^aX,]. 

The  radiant  energy  component  7)  is  generated  by  integrating  the  blackbody  spectral  radiation  from 
a  thermally  emitting  source  attenuated  by  the  spectral  responsivity  of  an  infrared  (IR)  filter  over  its 
spectral  bandwidth,  and  multiplying  the  result  by  the  emissivity  of  the  source  (random  emissivity  of  the 
source  over  (X.-'/iAX,,  X.+'AaX.]),  i.e.,  s.(e,T)  =  e.L,(T),  where  e.  =  e(u  \  u  G  (X,-'/2aX,,  X.+'^aX.]), 

L,(7)=  (1) 

and  r.(X)  is  the  spectral  response  or  the  so-called  point  spread  function  (psf)  of  the  ith  sensor 

characterized  by  the  center  wavelength  and  bandwidth  aX.,  L(X,7)  is  Planck’s  function  of  the  black 
body  radiation 

^  ^{exp{c^/\T)  -  1)“'  (2) 

and  Cj  -3.7415x10'^  Wcm  \  Cj  =  1.43879x10'^  /im  *K  [7].  Therefore  the  ^-variate  observable 
is 


■yi(e,7) 

= 

+ 

; 

,  s.{e,T)  =  e.L.(T) 

(3) 

where  bold  letters  represent  vectors,  underlined  bold  letters  represent  matrices,  and  prime  denotes  the 
transposition.  As  is  evident  from  (2),  the  blackbody  IR  filter  output  L.(T)  is  nonlinear.  This  poses  many 
theoretical  and  implementation  problems.  However,  because  the  temperature  range  is  usually  known, 
we  may  approximate  it  by  a  power  series.  This  is  addressed  by  the  theorem  below. 
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Theorem  I.  Suppose  f  LHf  dT  <  oo  ,  then  within  the  interval  T^-^'ht.T^]LfT) 

]  T-'AtT, 

may  be  approximated  arbitrarily  close  by  a  power  series  of  finite  order 

LCD  =  a^o  +  ^  M  G  (4) 

and  approximation  is  in  the  integral  square  error  (ISE)  sense. 


In  addition,  for  reasons  that  will  become  apparent,  this  approach  requires  a  very  low  order  power 
series  and  incurs  a  negligible  error.  This  observation  is  confirmed  in  [8,9],  which  considered  the 
approximation  of  L.CD  using  a  second  order  power  series.  The  uniform  convergence  of  the  power  series 
approximation  implies  the  convergence  in  the  mean  square  sense  and  with  probability  1  [11]. 
Consequently,  we  may  assume  the  blackbody  filter  output  L-CD  is  known  (almost)  exactly.  The 


estimation  of  the  temperature  coefficient  vector  a,.  =  [  -  a^]'  will  be  considered  later. 


When  there  is  no  target,  the  signal  components  (thermal  radiant  energy)  S;(e,D  depend  solely  on 


the  naturally  occurring  backgrounds,  which  may  be  composed  of  natural  scenes  such  as  water,  soil,  sand, 
etc.  This  is  indicated  by  the  subscripted  letter  b  {background).  In  the  presence  of  a  target  that  fills  an 
entire  pixel,  then  the  signal  components  depend  solely  on  the  multispectral  emissivity  of  the  target  and 
the  corresponding  blackbody  filter  response.  This  is  marked  by  a  subscripted  letter  t  {target). 

The  detection  of  the  target  imbedded  in  background  clutter  is  our  main  concern  and  can  be 
formulated  as  a  composite  hypothesis  testing  problem 


Sb,ii^,D 

w,  1 

II 

H 

+ 

,  5,,(e,7)=e,,L,(r^,  i=l,-,N 

1 

s,,i{e,D 

Wi 

x  = 

_s,.^e,D_ 

+ 

,  s,,{e,D-e,.L{T),  i=l,-,iV. 

(5) 


It  is  well  known  that  the  Neyman-Pearson  test  statistics  for  above  model  is  [10] 

I  "‘I 


(6) 


where  r  is  the  threshold  for  a  prespecified  type-I  error  a,  f{.)  the  probability  density  function  (pdf)  of 
X.  Evidently,  the  specific  form  of  the  test  statistics  X{.)  is  a  function  of  the  probability  distributions 
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of  the  sensor  noise  field  w  =  [  Wj  -  w^]' ,  the  multispectral  emissivity  fields  >  and  the 

temperatures  T,  y=b  (background  clutter),  t  (target),  respectively. 

III.  OPTIMAL  MULTISPECTRAL  DETECTORS 

For  general  development,  the  following  assumptions  are  made: 

i)  The  blackbody  radiation  flux  =  [  Lj(r)  )  ]' ,  y  =b,t  is  known  or  can  be  approximated 

reasonably  close; 

ii)  The  multispectral  emissivity  field  =  [  e^{\)  •••  e^(X^)  ]'  =  [  ...  is  normally  distributed 

)>  i-e., 

/  0)  =  (  {2-kY  I  J  expj  -  >/2  (y  -  )'  (y  -  )|,  y=b,  t,  (7) 

where  |  .  |  denotes  the  determinant  of; 

iii)  The  sensor  noise  field  w  =  [  Wj  •••  is  also  normally  distributed  j  and  is  statistically 

independent  of  the  emissivity  field. 

Combining  i),  ii),  and  eq.(3)  we  get 

iv)  The  multispectral  signal  field  . ]'  is  normally  distributed  j, 

with  the  mean  vector  ]' ,  ,  =  E  |  e^.,  L.(T^)  j  =  ..  L.(T^) ,  and  the  covariance 

matrix  =  {  cov(  s^.^,  s^.j ) } ,  each  element  is 

~^^JKJ~^^sJ}=cov(e^.,  eJL.(T^)Lj(T),  iJ=l,-,N;  y=b,t.  (8) 


The  probabilities  of  the  observation  vector  X  conditional  on  H.  are 

I  ^)  =  I  - 1(  (2^)^  I  i:,,  I  1 I  )""  exp  {  -  >/2  (x-s^-  fij  *  (X-S^-  ulJ  } 

X  exp  {  -  '/2  (s^  -  (s^  -  J  ds^,  y  =b  for  t  for  H,. 

Since  the  covariance  matrix  ^  is  symmetric,  so  is  its  inverse  ,  and,  together  with  the  identity 
(ABC)'  =C'  B'  A' ,  we  have 

^  -  mJ'  X  ~  -  2x':^\  + 

-  M,/  ±;‘  (s^  -  ny = s!^±-\  -  2s'^±y^  +  . 

Summing  the  above  two  equations,  we  obtain 
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Q{x,s^)  =  {x-  nj  :ir^  (X  -  nj  +  +  ±;‘ )  -  '^s[  {i:;^‘(-«^  -/*„)+ 

Substituting  (9)  into  (6),  the  optimum  detector  is 


ig(jc)  = 


K,ex^{-^AQix,s)]ds, 


^jexp{-*^Q(jc,Sj)}4/5j 


(10) 


(11) 


with  the  normalizing  constant  =^(2ir)^  |  |  |  j  y-b,t,  and  Qix,s^  defined  in  (10). 

We  proceed  to  evaluate  each  of  the  multi-dimensional  integrals. 

Since  are  covariance  matrices,  they  are  symmetric  and  positive  definite  [11], [12]. 

These  properties  extend  to  their  inverses  and  sum  of  their  inverses.  There  exist  two  sets  ofiV  x  1 
orthonormal  eigenvectors  {''yyl  and  corresponding  eigenvalues  {\y}  such  that  [13] 

(±;^‘ +±;')v^y.  =  x;i  v^^,  =  6.,  6..  =  Kronecker  delta,  y=b,t. 

Form  the  matrices  =  [  v^.j  -  v^.^] ,  =  diag[  X^.^^] ,  then  it  is  apparent  that 

(^-1 +,jr-i)y  =A-‘y,  (12) 

^  — w  — y  y  y 

W  =V  V  =l  y=b,t.  (13) 

Y  Y  Y 

Multiplying  (12)  by  V'  and  applying  (13),  yields  V'  (  )  V  =  A'* .  Making  use  of  the 


identity  (ABCy^  =  C'*  B'*  A  ' ,  the  inverse  is 


y/(,lr-i  +^-i)-iy  =A  . 

— y  — w  s  y  y 


(14) 


The  symmetry  and  positive  definiteness  of  implies  that  elements  of  A"‘  are  all  real 

positive  numbers,  and,  therefore,  its  square  root  A"’"^  exists.  Apply  a  linear  operator  =1^  A"’'^,  then 
X^=IJ!^X,  =  H,  =  I/'  11^ ,  W^  =  IJ!^W,  =  C/'  •  Equivalently,  since  1^  =  A]^  V'*  =  A]^  V' , 

we  get  AJl^  X  ,  =  V  A^  f,  =  V  A^  il,,  w  =  w^,  = 

Rewrite  (10)  as 

Q(x,s^)  =  (X  -  mJ'  ^ 

+  W  A'^s-  2<  A'-^V'/V-'V V(x  -  A’V,\ 

y-y  -y\  — ry-y  y  ^  -y  -y\—^  -y-y  ^  “y“y  M 


where 


^-1  =  A’'^ A'-^,  =  A‘^ y'^-'y  a’'^. 

— ““y  “’y"”^  ””y  ^y  y  y  y  y 


(15) 
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We  define  “M*  )  ‘‘'SirVj  and  complete  the  matrix  squares  to  obtain 


Q(x,s^)  =  II  ^  -  A  IP  (X  -  At,/  jr;  (X  -  At  J  + 

-  -  H)'  Kl  K]  M J  -  m;, ±;;  x;  -  2  (/  -  At,/  i:;;  i:;;  At, 

=  II  f  -i  IP  +  (x  -At,/ (i:;'  -3::;^)(x 
-2(x  -At,)'!:;'!-*  At,. 


(16) 


By  the  identity  |  \  =  |A|  |B|  | C| ,  the  determinant  of  3^.  is 


y  y  — — y  — 


=  A-'^^V'^  VA-''^ 

y  y  Sy'~^y  y 


rii 


£  . 


(17) 


Applying  (16)  while  taking  into  account  the  Jacobian  tfs  =  I  V  A'^  Ids  =  |A  I'^ds  1141  each  of 

y  y  "“7  Y  *■  ^ 

the  integrals  (11)  is  evaluated 

\  -\  ((27r)^|£^|  |£J  pexpj-  •AG^3,(i  )| \A\'^ds^ 


=  ((27r)'^|£^|  j  '^exp  |  -  'A 


(^  -M,/  (£;■  -£;^)(x  -AtJ  +At((£p  -£/)/t, 

y  y  y  '  \  y 

-  2  (X  -  At,/  £;*  £;'  A*,  ] },  y=b,  t. 


(18) 


The  last  line  is  due  to  (17)  and  the  identity  |  -  j  ((27r)'^|£^  |  exp{  -  ||  f  -i  =  i . 

Taking  the  logarithm  of  (18),  transferring  all  normalized  constants  to  the  r.h.s.,  and  simplifying, 
a  sufficient  statistics  for  the  likelihood  ratio  detector  in  the  form  of  (6)  or  (11)  is 

^(x)  =  !2(x„At,  )  -  Q(je„At,  )  >  <r  (19) 

‘  '  «o 

where 

ec  )  -  (X,  -  /  (♦;;-»;=  )(x  -  -  ♦;> ) -  2  (f^  -  „ J'  j-i  i,^  . 

(20) 

In  general,  it  is  preferable  to  work  in  the  original  W  space.  Therefore,  apply  (12)-(14)  together 
with  the  transformed  vectors  listed  right  after  (14)  to  obtain 

(^  -  M,/  £;'  (X  -  At, )  =  (X  -  nj  V  A-'^^A^V^-^V  A'^A-'^V'(x  -  At  ) 

T  Y  Y  Y“Y - w—v— v— Y  — Y 


y  y  y' 

=  0c-nJ'X!(x-nJ 
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=  (.X  - ilJ  V  a;^  a;" a;^  )  ( ~ 

=  (x-  nj'  X'  ( X'  +  X'  )■’  3r‘  (Jf  -  mJ 


and 


(jc  -  fi^ y  s=ix-nyv  A-'^i  A'^y'^I'-'y  A'-^  )  (  A’^ A'-^  )  A-'^ V'a 

'-  y  '^v  — I,  y  ^  —y—y  —y  —y—^  —y—y  —y  “Y”-',  “Y~r  ~y  ~y  ^ 

■*7  ’’t 

Thus,  in  the  original  (pretransformed)  space  the  quadratic  term  (20)  is 

Q{x,v.^ ) = (JC  -  mj'  X  ^  (-C "  X  ^  -C )  ^ 

-2(jc-Mj'3::‘(i::‘+i:;‘)-‘i;'^ 


or,  equivalently, 
Qix,u  ) 


X 


^-1  /  Tjf-l  +  ^-1  Y-1  ^-1 
- w  ' w  — s  '  — s 

_ I _ ^  u 

■^-1  -  -^-1  r  -^-1  ^  -^-1  S-i  ^“1  ^ 

- W  - W  w  — '  w 


/^-i  _  -^-i  (  ^-1  +  Tjr-1  )-i  Ijr-1  \ 

( - w  - W  ^ - w  - '  - w  j 


x-K 


^“1  _  f 

— W  W  ^ W  S  - w 

7 

/  ^-1(^-1  +2'*  y^x^f 

”  *  ***  **'  ^T  / 

— — H.  —X,  — J,  ^-1  .Ijr-l  (^-1  +^-l)-l^-l 


f^s: 


The  intermediate  statistics  Q{x,n^  )  measures  the  distance  square  of  the  decorrelated  observation 
vector  JC  to  the  vector  sum  of  sensor  noise  mean  u  and  the  scaled  signal  vector  mean  u  ,  this  is  added 
by  a  bias  iii  { -  }n^ .  In  practice,  the  bias  is  merged  with  the  threshold.  Furthermore,  simple  algebra 
shows  that  )■'  and  )'*  =  ( Sk  +’*'  )’’ ,  leading 

- VV  - W  - w  - - W  - W  - - w  - w  “W 

to  a  sufficient  test  statistic 

^ix)  =  Q(x,n^  )  -  Qix,n^)  >  <T  (21) 

‘  '  «, 

where 

(2(x,m,,)=(a:-».-(i,  )'(♦,*♦_ r'(:t-M.-  ^)■  (22) 
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Observe  the  simplicity  of  the  quadratic  term  (22)  as  compare  to  that  in  (20).  The  primary  reason 
for  the  intermediate  statistics  Q(x^,Hj  )  is  to  orthogonally  transform  x  (rotation  and  scaling)  to  x^  by 

the  linear  operator  U^,  y  =b,t.  This  renders  components  of  the  signal  field  f  ,  which  is  a  temperature- 
embedded  emissivity  vector,  statistically  independent  of  each  other.  This  permits  to  be  integrated  with 
respect  to  its  {a posteriori)  multivariate  pdf,  yielding  Q(x^,Hj  ) ,  whidi  is  subsequendy  transformed  to  (2(x  ,n  ) 

back  in  the  original  space.  The  detector  ^(x)  (21)  operates  as  follows:  each  intermediate  statistic 
)  prewhitens  the  zero-mean  observation  vector  x-  and  then  square  it.  If  Q{x^,n^  )  is 

greater  than  ),  then  (target  present)  is  selected,  otherwise  (target  absent)  is  decided.  The 

detector  scheme  is  shown  in  Fig.l. 


IV.  SUBCLASSES 

The  generality  of  the  optimal  IR  detector  in  the  form  of  (21)  will  become  clear  when  the  sensor- 
noise  field  and/or  emissivity  field'  e^,  y=b,  t,  are  subject  to  further  regularity  conditions: 
a)  Constant  emissivity  detector 

Consider  ,  which  implies  that  is  null.  Eq. (22)  becomes 

Qc{x,il^  ).  (23) 

The  resulting  test  statistic 


=(^ (jc -M. -mj -(X (jc -M,, -mj 


>  < 


(24) 


is  identical  to  the  one  given  in  the  previous  work  [15]. 

For  the  insight  into  ^.^(x),  repeat  the  orthogonalization  procedure  in  Section  III  to  decorrelate 
the  random  field  x  -  ,  t  =  b,t:  introduce  a  unitary  matrix^  V  such  that  V  Y~ Y!  ^  '  V  =  A'' , 

and  apply  a  linear  operator  U  =  to  transform  x,  ii  ,  u  to  x  =  U'  x  =  x  u  =  A‘'^V'  u 
IL,  =A'''‘Y!  ft,  ,  respectively.  In  the  transformed  space  Qc(x,n^  )  is 


Because  of  the  relation  {  S^..  -  e_^,.  L.(T^)  }  and  condition  i),  the  terms  emissivity  and  signal  (radiance  energy)  are 
used  interchangeably. 

2 

Since  signal  field  is  no  longer  random,  only  one  set  of  V  is  required. 


8-10 


(25) 


)  =  ( JC  -  M.  -  M,, )'  V  A'^  X'  ^ ) 

=  (x  -  -  fi,  )'  (x  -  -  H,  ) 

=  E/!, 


The  transformation  associated  with  U  =  VA"''^  being  a  linear  operation,  the  mean  and  the 
covariance  of  w  are  Eiw)  =E(\-''‘V' w)  =  n^,cov(w)  =coviA-''‘V  w) 
=  A‘''^y'  cov(h’)  VA'''^  =/,  respectively. 

Based  on  the  signal  model  (5)  and  the  general  Gaussian  assumptions  i)-iv),  under  hypothesis  flg, 
the  mean  and  the  covariance  of  x  -  n^-  are 

Eix-n^-li,\H^)=El^(n,+w)-n^-H,)=H,-H,^ 
covix-fi^-  n,  \H^)=cov{^(n,^*w)-fi^-n,  )  =  cov(w)=‘ I. 

Similarly,  under  hypothesis  ,  we  have 
E(x-n^-li,  I 
covix-n^-H,  \Hg)=l. 

y 

Eqs.(26b,d)  are  due  to  the  shift-invariance  of  covariance. 

Evidently,  under  hypothesis  //.  each  intermediate  statistics  (23)  is  a  noncentral  chi-square  variable 

with  N  d.f.  (degrees  of  freedom)  and  the  noncentrality  parameter  6(  T^,T^ ) ,  8\  t^,t^  )  =  (  t  j..  -  Tj.,-  f , 


(26a) 

(26b) 

(26c) 

(26d) 


i.e.. 


Qc{x,ii,  \H.)  = 


H:=H^ 


■^r 


(27) 


Divide  (24)  by  j2c(  )  to  arrive  at  the  test  statistics 


1  -.’2 


ig  (x)  =  =  N  ^  ^  ^ 

Gc( 


(28) 


Under  hypothesis  H^,  the  statistics  ^^^(^  noncentral  F-distribution  with  N  and  N  d.f. 

and  noncentrality  parameter  (/i^.,  -  /^,;,  )^-  Th^t  is,  given  any  significance  level  a  of 

the  test  ^c(x),  the  threshold  t  can  be  found  from  [16] 
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cc  =  Prob  'I 


'TjXN 


>  T 


^  =1  -Prob\ 


•r  Xat 


=  1 


Pro6  I  ^  I 


which  leads  to 


Also,  the  power  of  SP.^{x)  under  hypothesis  is 

&=Prob\ 


(29) 


^XAr.JCMv/*^)  1 

>  T= - 


7;Xa? 


Qt;Ar^;  ) 


(30) 


Therefore,  for  a  specified  significance  of  the  test,  one  can  refer  to  a  F'  -table  to  determine  the 
threshold  t,  r  and  represents  the  upper  7  point  of  a  F’  random  variable.  The 

power  ^  of  the  test  is  then  evaluated  using  (30)  (fixed  t).  By  the  coincidence  of  the  statistics iSp(Jc) 

with  the  F'  statistics,  robustness  is  assured  [16],  [17]  for  small  deviation  from  Gaussian  distribution. 
In  particular,  based  on  the  robustness  of  the  constant-emissivity  detector,  we  may  infer  loosely  that  the 
new  detector  will  retain  a  high  probability  of  detection  for  populations  deviating  from  normal.  We 
remark  that  in  the  case  wherein  components  of  the  random  noise  field  are  pairwise  statistically 
indqiendent,  thai  is  a  scaled  identity  matrix  (equal  variance  over  all  color  bands),  and  QJ,x,  ii,),Q{x,n  ) 
equals  one  and  the  other,  implying  that  the  orthogonal  transformation  is  not  needed. 


b)  MAP  emissivitv  detector 

In  the  presence  of  the  random  emissivity  field,  the  pdf  of  x  given  //.,  required  in  the 
implementation  of  Neyman-Pearson  detector  (6),  is  evaluated  by  the  pdf  of  x  given  K  and  and 
integrated  over  the  distribution  of  s^.  This  accounts  for  the  multi-dimensional  integration  and  is, 
generally  speaking,  formidable  to  implement  in  terms  of  computation  time.^  One,  however,  may 
circumvent  this  difficulty  by  replacing  the  vector  by  a  new  vector  that  is  free  from  the  distribution 
of  s^.  Intuitively,  one  prefers  to  settle  on  an  that  maximizes  the  conditional  a  posteriori  pdf 
y  replacing  ^i^(x 1 77.)  by  niax^  ^i^^(a:|/7.,5  ).  In  the  general  normal  distribution 


3 

The  complexity  can  be  appreciated  by,  trying  to  use  a  digital  computer,  approximating  integration  over  a  color  band 

by  10^  subintervals,  then  3  color  bands  required  10’  such  actions.  However,  the  complexity  can  be  alleviated  by 
forcing  the  independence  of  emissivity,  probably  not  valid  in  reality,  over  the  spectral  bands. 
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setting  (conditions  i)-iv)  in  Section  III),  as  conveyed  in  (9), (10),  since 
,  (x|H.,s^)  =  ((27r)'^  I  y'^  exp  {  -  V2  Q(x,s^) } ,  a  particular  that  maximizes /^i^_,  (xl//.,s^)  also 

minimizes  Q(x,s^) .  Therefore,  taking  the  derivative  of  Q(x,s^)  defined  in  (10)  with  respect  to  ,  using 
the  formulas  (d/dA)A'B=B  and  {dldC)  C'DC  =  2DC,  yields 
(dlds  )  Qix,s  +±'^)s  -2'9'\x-nJ -2  .  Setting  it  to  zero,  the  maximum  a  posteriori 

y  y  H*  y  H*  y 

(MAP)  emissivity  vector  is 

= ( 3:;^*  +  I  ±;^'(  jc  -  M,. ) + j-  (3 1) 

Taking  the  expectation,  it  is  obvious  that  is  an  unbiased  estimate  of  s^.  Noting  that 
(dVdsl)  Q(x,s^)  =  2  (3l^‘  +ilj‘ )  is  positive-definite,  because  both  are  both  positive  definite, 

is  a  necessary  and  sufficient  condition  for  the  minimization  of  the  intermediate  statistics  Q(x,s^)  and, 
therefore,  the  maximization  of ,(x|H.,s^).  The  sulficient  statistics  for  the  generalized  likelihood 

detector  [10],[15]  -  /  ,  ,  J  t  can  be  represented  by 


^^(x)=Q^{x,S,)  -  Q^ixJ,)  ><  T 


(32) 


where 


Q,^ix,S^)  =  (x-fi^-S^y  {cov-^ix-n^-S^)){x-fi^-S^). 
Upon  using  s  evaluated  before,  the  intermediate  statistics  (33)  is 


Q^i  )  =  [  ( 7  -  ( +  X'  )■*  )( M  J  -  ( P  v  M.,  J  COV  -‘( X  -  ) 


(ir-i 

^ ^ 


(33) 


//  -  (^-1  +  )-i  ^-1  \  cov'^x  -  p  -s  ) 

\  - W  I  w  7 


x(7-(3:;^‘  +  3:;' )■' 3;;* ) 


X-t^y, 


(3^1  +5r-i)-i5r-l 

—  '  — W  — “C 
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Making  use  of  (31),  the  covariance  of  x  -  ii  -§  is  cov(x  -  u  -§  ) 

w  y  \  •w  y  ' 

(cov(x-M„))(/-(jr‘ +£-')-> andcov(x-M^)  =  covis^  +  w  - fij  .  Therefore, 


=  cov 


the  intermediate  statistic 

Q>.ixJ)  = 


X  -  u 


^"1  +  '^*‘1  ')■!  -^”1 
— w  — Vi/  — 

_  ^-1  (  ^-1  +  ^-1  yi  ^-1  ^ 
—vv  '  —v 

^-1  (^-1  +  ^“1  \-l  ^-1 

- w/  - K/  - s  ' 

- 1 - 1 -  U 


+£  )-' 


(34) 


is  identical  to  its  counterpart  (22).  In  short,  the  likelihood  ratio  detector  and  the  MAP  detector  are  the 
same  under  general  Gaussian  distributions.  We  remark  that  in  general  because  the  MAP  detector  does 
not  use^//  information  associated  with  the  distribution  of  the  signal  field,  it  is  suboptimum  as  compared 
to  the  likelihood  ratio  detector. 


V.  RESULTS  OF  EVALUATION 

The  detector  ig(x)  in  the  form  of  (21)  takes  into  consideration  a  random  emissivity  field,  whileig£,(x) 
in  the  form  of  (24)  is  designed  on  the  assumption  of  a  constant  emissivity  vector.  It  is  then  of  interest 
to  compare  their  performance  in  terms  of  power  for  the  same  type-I  error.  For  ease  of  analysis,  only 

a  one-dimensional  observation  vector  is  assumed.  This  implies  ^  =  ol,  =al,  and  orthogonal 
transformation  is  not  needed.  The  detector  ^(x)  in  (2  1)  is 


Multiplying  both  sides  by  at  +  a\,  yields 

^{x)  =  {x-n^- y-  -  r~(x-ii^- y  ><  T, 


(35) 


where  r,  defined  as  r  -  {(at  +  +  a; )  ,  is  a  measure  of  the  ratio  of  the  noisy  background 

power  to  the  desirable  noisy  target  power.  Similarly,  the  constant  emissivity  detector  (24)  is,  after 
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dividing  by 


(36) 


Under  hypothesis  H^,  is  while  {x-n^-n^Y  is  (at  o^,)  Xu-i> 

8=11^  .  Using  the  identity  Xnls  =  (?  +  +  X^i ,  where  ?  is  independent  of  xl-i  and  is  iV(0, 1)  [16], 

the  false  alarm  encountered  by  ig(x)  and  igc.(x)  are 

ct=Prob{ial^al)U^-{ial^al)y(al^a:)}{V-8f  >  r}  (37a) 

oCc  =  Prob  I  {al  +  a])  W  -  (al  +  al){V  -  8f  >  r,, }  (^^^) 

respectively,  and  U,  V  —  A^(0,1)  and  is  independent  of  each  other.  For  comparison,  the  conditioner  = 
is  imposed,  and  can  be  satisfied  through  adjustment  of  the  thresholds  t,  t^. 

By  the  same  token,  under  hypothesis  (x  -  Y  is  (a^  +  a])xi,i, ,  while(x  -  \i,  Y 


is  (al  +  al)x]  ■  The  power  of  detectors  ig(jc) ,  X^(x)  are,  respectively. 


/3  =  Prob  { (al  +  a:)(U  +  8Y  -  (al  +  <)  V’-  >  t  }  (37c) 

=  Prob  { (al  +  a])(U  +  8Y  -  (al  +  }.  (37d) 

The  lemma  below  is  convenient  for  evaluating  the  a’s  and  /S’s. 

Lemma  I.  Suppose  U,  V  ~  N(0,l)  and  are  statistically  independent  of  each  other.  Suppose  also 
scalar  constants  and,  in  particular,  ^  is  positive  definite.  Then  the  rv  Y, 


Y  =  ^](U  +  5,)2  -^l(V  +  8,Y ,  has  the  pdf 
00 

I 


iiiax(Oty) 


i.y)  8ir|,$,Vx(;c-y) 
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2 

exp 

-  V2 

s 

It  -J 

+  exp 

-  V2 

r'^ 

X 


-  ^ 

2 

2 

^  exp 

-  V2 

(x-y)'^  _5 

^2 

+  exp 

“  V2 

(x-y)'^  ,5 
^2 

r  dx. 


Setting  the  rv  (ff^  +  ff^)  C7-  -  { (ff^  +  alf^al  +  (t^)  }  ( V  -  8Y  equal  to  Y  in  Lemma  I,  the  distribution 
of  the  former  rv  is  identical  to  that  of  the  latter  with  the  parameters: 
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6,=0,  Sj-  8,  ^^=al  +  al,  +  crj) .  Therefore,  given  a,  the  threshold  t  is  selected  by 

OO  OD 

1 


“■f  I 


maxtO.y)  4^^ 


rrz- 


2^^ 
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r 

2 

' 

2 

■ 

exp 

-Vi 

^2 

+  exp 

-  Vi 

^x-yT  _5 
^2 

(38a) 


Similarly,  given  the  threshold  is  selected  from,  with  +  aj , 


j 

max(0,y) 


4ir^t^2^(x  -yj 


exp 


2|? 


’’ 

r 

2 

- 

2 

exp 

-  V2 

^2 

+  exp 

-  '/4 

_5 

^2 

► 

f  iicdy. 


The  powers  resulting  from  ^.(x),  ^.^(x)  are  then  found  by  (r,  fixed) 

OB  CD 

1 


(38b) 


I  maxjo.y)  47r^j^2V^(X  -y) 


with  ^,=(T„  +  a",  +(r^; 


_  x-y 

2 

2 

e  ^ 

exp 

1^1 

+  exp 

-  '/2 

[  dlxify 
(38c) 


W  OD 

^  ma^lo.y) 


47r^il2V^(x  -y) 


,  (Jf'T') 


exp 


>/2 


+6 


exp 


-  ’/2 


£_-6 


’  t£cd[y 
(38d) 


with  ^,=^2=a^  +  (7j. 

Since  eqs.(38)  cannot  be  expressed  in  a  closed  form,  we  resort  to  numerical  computation.  In 
particular,  we  consider  the  detector  operating  characteristics  when  =  0.5 ,  =  0.4 ,  n^=0,  al=  0.5 , 

ol  =  0.3 ,  and  varies  from  0.05  to  0.30 .  Results  are  depicted  in  Fig.2.  The  superiority  of  .^(x)  over 
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S?^(jc),  even  for  a  single  color-band  environment,  is  self-evident. 


VI.  CONCLUSIONS 

We  analyzed  a  detector  on  the  basis  of  a  model  whereupon  each  signal  component  consists  of  the 
radiant  thermal  energy  contributed  by  the  emissivity  and  the  blackbody  radiation  reference  L,(7) ,  at  a 
particular  bandwidth  in  the  electromagnetic  spectrum,  of  either  the  target  or  the  background.  SinceL,.(7) 
is  a  non-linear  function  of  the  unknown  temperature,  a  low-order  power  series  is  employed  for  its 
approximation,  using  the  integral  square  error  (ISE)  as  a  fidelity  criterion  and  recognizing  that  the 
temperature  range  is  usually  known.  An  obvious  advantage  of  this  approach  is  that  because  the 
coefficients  of  the  power  series  remain  constant,  the  approximation  of  L;(7)  can  be  easily  computed  for 
any  varying  temperature,  provided  it  is  within  the  temperature  range  for  which  the  ISE  was  designed. 
More  importantly,  a  reasonably  good  estimate  of  L.(T)  may  be  obtained  even  if  there  is  a  mismatch 
between  the  actual  temperature  and  its  predicted  counterpart.  A  less  obvious  advantage  is  that  the  power 
series  approach  permits  evaluating  statistics  (e.g.,  mean  and  variance)  of  the  underlying  signal  when  the 
temperature  is  also  random.  However,  the  ISE  method  is  pessimistic  because  it  assumes  that  the  variable 
temperature  is  uniformly  distributed  throughout  the  temperature  range.  If  the  probability  distribution  of 
temperature  is  known,  a  more  appropriate  criterion,  i.e.,  the  mean  square  error,  should  be  applied 
instead.  Note  also  that  no  approximation  is  required  if  the  temperature  is  known. 

A  sufficient  statistic  i£(jc)  in  the  form  of  (21)  consisting  of  two  intermediate  statistics 
Q(x,n^ ),  T=b,  t,  when  the  underlying  sensor  noise  and  spectral  emissivities  are  normally  distributed, 

is  derived.  The  new  class  of  detectors  exploits  the  spectral  energy  differences  that  exist  between  a  desired 
target  signature  and  the  background  over  the  wavelengths  that  make  up  the  spectral  bandwidth.  To 
understand  the  mechanism  of  the  new  detector  iS(x)  we  proceed  as  follows:  suppose  is  true,  then 
E  {  Q(  x,n^^  )\Hg}  is  zero  and  E  {  QC  x,ii^  )  |  }  is  a  negative  quantity,  implying  thatE  {^.(x)  \  H^} 

is  a  negative  quantity,  with  the  resulting  decision  that  is  true  (which  is  correct);  similarly,  if 
E{^.{x)  \  H^}  =E{Qi  x,ii^^  )\H^}  -E{Qix,ii^)\H^}  =  +ve,  again  the  correct  decision  that  is 

true  is  reached.  Therefore,  the  new  detector  yields  a  powerful  design  for  enhancing  the  target  and 
suppressing  the  clutter  background  and  whitening  the  multispectral  data  prior  to  thresholding,  on  a  pixel- 
by-pixel  basis. 

It  is  then  shown  that  under  more  regular  condition(s),  the  new  detector  represents  other  classes 
of  detectors,  including  the  constant  emissivity  detector  and  the  MAP  detector.  This  suggests  that  the  new 
class  of  detectors  performs  at  least  as  well  as  other  classes  of  detectors  and  is  better  than  the  latter  when 
the  more  restrictive  condition(s)  are  not  encountered  in  a  given  application. 
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The  performance  of  the  proposed  detector  is  demonstrated  by  extensive  numerical  computations. 
Although  only  one  pixel  target  is  considered  in  the  research,  with  slight  modification  the  same  detector 
can  be  applied  for  targets  filling  less  than  one  pixel.  On  the  other  hand,  if  a  target  occupies  more  than 
one  pixel,  well-developed  spatial  filtering  techniques  [18]  may  provide  additional  information  to  allow 
discrimination  between  targets  and  backgrounds  if  their  spatial  frequency  characteristics  are  different. 

Finally,  we  remark  that  since  the  proposed  detector  exploits  differences  in  the  spectral  energy 
content  of  target  and  background  radiation,  it  may  be  more  effective  for  combating  targets  in  camouflage 
or  using  false  coloring  as  compared  to  detectors  based  on  emissivity  alone. 
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Fig.l  Optimum  IR  detector 
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Detection  Power 


Fig. 2  Performance  comparison  of  random  emissivity  detector 
versus  constant  emissivity  detector 
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DETECTION  PERFORMANCE  OF  OVER  RESOLVED  TARGETS  WITH  NON-UNIFORM 
ENERGY  DISTRIBUTION  IN  THE  RANGE  CELLS 

Ajit  K.  Choudhury 
Associate  Professor 
Department  of  Electrical  Engineering 
Howard  University 

Abstract 

Many  formula  involved  in  computation  of  radar  detection  performance,  such  as  incomplete 
Toronto  functions  and  modified  bessel  functions,  have  very  bad  numerical  behaviors.  This  fact 
leads  to  errors  of  the  results.  Also,  in  the  calculation  of  radar  detection  performance,  it  is 
generally  assumed  that  the  energy  is  uniformly  distributed  over  the  resolution  cells.  However, 
in  practice  the  cells  may  have  different  energy.  In  these  cases,  modifications  are  necessary  in 
computing  the  detection  performance. 

In  this  paper,  we  developed  numerical  methods  to  overcome  the  numerical  difficulties  and 
thus  obtained  much  more  accurate  results.  Also,  we  developed  a  realistic  model  of  energy 
distribution,  and  found  that  the  probability  of  detection  generally  tends  to  increase  when  the 
energy  is  not  uniformly  distributed.  The  phenomena  are  interpreted  by  the  shape  of  the  curves 
corresponding  to  uniformly-distributed  energy. 
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DETECTION  PERFORMANCE  OF  OVER  RESOLVED  TARGETS  WITH  NON-UNIFORM 
ENERGY  DISTRIBUTION  IN  THE  RANGE  CELLS 

Ajit  K.  Choudhury 


Introduction 

The  formula  to  compute  radar  detection  performance  can  be  found  in  many  Uteratures, 
such  as  [l]-[2].  A  serious  difficulty  existing  in  radar  detection  performance  calculation  is  that 
many  formula  involved  have  very  bad  numerical  behaviors,  and  thus  lead  to  significant  errors  of 
the  results.  Typical  numerical  error  sources  are  truncation  of  numbers,  subtraction  of  nearly 
equal  numbers,  truncation  of  series,  numerical  integration,  overflow,  underflow,  and  open-range 
integration.  The  overflows  and  underflows  are  mainly  caused  by  two  involved  special  functions  - 
-  incomplete  Toronto  functions  and  modified  Besset  functions. 

The  target  models  we  have  studied  include: 

1)  single  pulse,  constant  amplitude  scatters, 

2)  Single  pulse,  Rayleigh  scatterers, 

3)  single  pulse,  dominant  puls  Rayleigh  scatterers, 

4)  multi-pulses,  constant  amplitude  scatterers, 

5)  multi-pulses,  slow  fluctuating  targets,  Rayleigh  scatterers 

6)  mvilti-pulses,  fast  fluctuating  targets,  Rayleigh  scatterers, 

7)  multi-pulses,  slow  fluctuating  targets,  dominant  plus  Rayleigh  scatterers, 

8)  multi-pulses,  fast  fluctuating  targets,  dominant  plus  Rayleigh  scatterers. 

The  target  models  with  good  numerical  behaviors  includes  single-pulse  Rayleigh  scatterers 
and  single-pulse  dominant  plus  Rayleigh  scatterers.  The  target  models  with  bad  numerical 
behaviors  include  single-pulse  constant  ampUtude  scatterers  and  multi-pulse  constnat  ampUtude 
scatterers.  The  target  models  with  some  numerical  problems  especially  when  the  number  of 
pulses  is  large  and  false  alarm  probabUity  approaches  zero  include  multi  pulse  Rayleigh  scatterers 
(slow  fluemating  or  fast  fluemating  targets)  and  multi-pulse  Rayleigh  scatterers  (slow  flucmatrng 
or  fast  fluemation  targets). 
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Another  problem  associated  with  detection  performance  calculation  is  the  energy  model. 
In  the  existing  works,  it  is  generally  assumed  that  the  energy  is  uniformly  distributed  over  the 
cells.  However,  in  practical  applications,  the  cells  may  have  different  energy:  i.e.,  the  energy  is 
not  uniformly  distributed  over  the  cells.  In  these  cases,  the  actual  probability  of  detection  will 
differ  from  that  with  uniformly  distributed  energy  and  most  of  the  formula  involved  in  computing 
the  probability  must  be  modified. 

A  difficulty  in  studying  the  effects  of  energy  distribution  is  caused  by  too  many  possible 
distribution  patterns  of  the  energy.  To  overcome  this  difficulty,  we  have  developed  a  simple 
energy  distribution  model,  pulse  numbers  and  cell  numbers.  Based  on  those  numerical  methods 
and  the  new  energy  distribution  model,  we  have  also  studied  the  effects  of  the  energy  distribution 
on  detection  p>erformance  for  all  the  target  models  mentioned  above.  Large  amoimt  of  figures 
were  plotted.  However,  because  of  the  space  limitation,  we  wUl  only  use  multi-pulse  constant 
amplitude  scatterers  as  the  examples  for  both  numerical  problems  and  the  energy  model.  For 
more  results,  please  read  Ref.  [2] . 

Involved  Formula  for  Uniformly  Distributed  Energy 

The  formula  involved  in  computation  of  radar  detection  performance  can  be  found  in 
various  literatures.  The  following  formula  are  obtained  from  Ref.  [1].  It  is  assumed  that  the 
energy  is  uniformly  distributed  over  the  resolution  cells. 

For  multi-pulse  cases,  the  relationship  between  false  alarm  probability,  Pfaand  threshold, 
Yb,  is  given  by 


P  =  ^  -  [I  N-^)f  (1) 

where  M  is  the  total  number  of  resolution  cells:  N  is  the  number  of  pulses.  I(u,  s)  is  the 
incomplete  Gamma  function  expressed  by 

m,  j)  =  M-  dt  m 

Jo  s! 

The  probability  of  detection,  Pj,  is  given  by 
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(3) 


=  1  -  (1-^*.)" 

where  Pd„,  is  the  probability  of  a  single  ceU  to  aaoss  the  threshold,  and  has  different  expressions 
for  different  target  models.  For  multi-pulse  constant  amplitude  scatterers: 

=  1-r^  [2N-1,  N-h  ,JnpWQ  ] 

where  N  is  the  number  of  the  pulses,  E  is  the  total  signal  energy.  is  the  total  noise,  and 
TB[m,  n,  r]  is  the  incomplete  Toronto  function  which  has  the  following  expression: 

TJimfl.r)  -  2r"-"-  e"'’  /*  t"-  B,(2rt)  dt 

and  Ba(z)  is  the  nth  order  modified  Bessel  function  expressed  by  either 


BXz)  =  S 


1 


k=o  k\  ik+n)\ 


m' 


M*7JC 


(6a) 


or 


= 


r(rt+i/2)  r  (1/2)  J-i 


(6b) 


where  r(n)  is  the  Gamma  function. 

The  Numerical  Difficulties 

Most  numerical  difficulties  are  caused  by  the  incomplete  Toronto  function,  modified 
Bessel  function  and  an  open-range  integral  (for  multi-prise  slow-fluctuating-target  dominant  plus 
Rayleigh  scatterers). 
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The  incomplete  Toronto  function  and  the  Bessel  function  will  generate  large  amount  of 
overflows  and  underflows.  The  open-range  integral  converges  very  slowly;  therefore,  in  order 
to  obtain  accurate  results,  we  must  use  a  large  upper  limit  of  the  integral.  However,  when  the 
limit  becomes  large,  underflows  and  overflows  may  occur. 

In  the  incomplete  Toronto  function  (see  Eq.  (5))  following  problems  may  appear. 

1.  The  factor  may  underflow  when  r  is  large  and  m  is  much  greater  than  n. 

2.  The  factor  exp(-r^)  may  underflow  when  r  is  large. 

3.  In  the  case  m  is  much  larger  than  n,  the  factor  t™  "  may  underflow  or  overflow 
when  t  is  small  or  large  respectively. 

4.  The  factor  exp  (-t^)  may  underflow  when  t  is  large. 

In  the  modified  Bessel  function,  if  the  series  form  (Eq.  (6a))  is  used  and  z  is  not  small, 
we  must  sum  up  many  tCTms  in  order  to  obtian  accurate  results.  Then,  the  factors  (z/2)"  ^.k!, 
and  (k  +  n)!  may  overflow.  If  the  integral  form  (Eq.  (6.b))  is  used  and  n  is  zero,  there  are 
singular  points  at  t=-l  and  t=l.  These  singular  points  wiU  seriously  degrade  the  accuracy  of  the 
results. 

The  numerical  difficulties  of  the  open-range  integral  are  discusse  in  Ref.  [2].  The  other 
numerical  error  sources  include  truncation  of  series,  truncation  of  numbers,  and  numerical 
integration.  Some  series,  such  as  the  modified  Bessel  function,  may  decay  slowly  with  the  index. 
Then  if  the  series  are  tnmcated  too  early,  the  results  are  not  accurate:  while  if  the  series  are  not 
truncated  early,  overflows  or  imderflows  may  occur. 

The  Numerical  MttliQds 

It  is  relatively  easy  to  reduce  the  error  caused  by  truncation  of  numbers  and  numerical 
integral.  We  can  reduce  the  errors  caused  by  trimcation  of  numbers  by  using  double  precision 
variables  which  are  generally  sufficient.  And  we  can  reduce  the  errors  caused  by  numerical  ng 
the  integration  intervals  finely.  Also,  if  double  precision  variables  are  used,  we  find  that  the 
errors  caused  by  subtraction  of  numbers  are  not  serious. 
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To  improve  the  behaviors  of  the  modified  Bessel  function,  we  use  the  integral  form  when 
n  is  not  zero  and  use  the  series  form  when  n  is  zero.  This  method  can  reduce  large  numbers  of 
overflows  and  underflows  and  to  avoid  the  singular  problems. 

The  remaining  difficulties  are  overflows  and  underflows.  To  handle  these  problems,  we 

classify  the  overflows  into  the  following  patterns: 

1.  S*f"  when  t  and  S  are  smaU  and  n  is  large.  In  this  case  the  factor  t  "  overflows. 

Generally,  when  a  factor  overflows,  there  should  be  another  factor  which  is  small, 
otherwise  the  problem  can  not  be  solved.  In  this  pattern  we  can  solve  the  problem 
by  using  an  equivalent  form  of  f"  * 

2.  S»r'"  when  r  and  n  are  large,  S  is  small,  in  this  case,  the  factor  r"  overflows.  We 
can  solve  the  problem  by  using  an  equivalent  form  of  r"'*’^" 

3.  S»2tj  when  some  of  tj  overflows  and  S  is  small.  In  this  case,  we  can  solve  the 


problem  by  using  an  equivalent  form  of  Z  (S»t_i  ). 

.^similarly,  we  can  classify  imderflows  into  the  following  patterns: 

1.  B*exp(-t)  when  B  and  t  are  large.  In  this  case,  the  factor  exp(-t)  imderflows. 
Generally  computer  will  assign  zero  to  the  under -flowed  variables.  This  method 
will  not  introduce  errors  if  no  factor  is  large.  Therefore,  we  only  need  to  take 
care  the  cases  with  a  large  factor,  B.  In  this  pattern,  we  can  solve  the  problem  by 
using  an  equivalent  form  of  exp  (-1  -i-/nB).  If  the  original  form  is  used,  the  value 
of  the  expression  is  zero,  but  the  equivalent  form  may  give  non-zero  values. 

2.  B»t"  when  t  is  small,  B  and  n  are  large.  In  this  case,  the  factor  1"  underflows. 

We  can  solve  the  problem  by  using  an  equivalent  form  of  *. 

3.  B«r'"  when  r,  n,  and  B  are  all  large.  In  this  case,  the  factor  r"  underflows,  and  we 
can  solve  the  problem  by  using  an  equivalent  form  of  r 

4.  B»Ztj  when  some  of  tj  underflows  and  the  sum  is  small.  We  can  solve  the 
problem  by  using  an  equivalent  form  of  Z(B»tj).  If  the  sum  is  not  small,  the 
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under-flowed  term  can  be  neglected  without  introducing  errors. 

By  using  these  method,  we  can  significantly  improve  the  accuracy  of  the  results.  The 
methods  can  eliminate  all  the  overflows.  Although  some  imderflows  stUl  exist,  they  will  not 
introduce  errors;  e.g.,  aU  the  factors  are  small,  or  a  term  in  a  series  is  neglectable  compared  with 
other  terms  in  the  series. 

The  Results  for  Uniformly-Distributed  Energy 

By  using  the  munerical  methods  developed  above,  we  plotted  large  amount  of  figures  for 
various  target  models  and  pulse  numbers.  The  figures  shows  the  probability  of  detection  vs.  the 
signal  to  noise  ratio  for  different  cell  numbers.  We  compared  those  figures  with  those  in  the 
literature,  and  found  that  our  results  are  much  more  accurate  than  those  in  the  literature. 

Fignres  1  shows  our  result  of  the  detection  probability  for  16-pulse  constant  amplitude 
scatterers  when  false  alarm  probability  is  1.1  S  1  or  other  cases  and  the  comparison  of  the  results, 
please  read  Ref.  [2]. 

Formulation  of  Non-Uniformlv  Distributed  Energy 

To  develop  the  energy  distribution  model,  we  assume  that  the  total  number  of  resolution 
cells  M,  is  given  and  the  energy  of  the  cells  is  fallen  into  a  preset  number  of  energy  levels.  The 
energy  of  the  second  level  is  twice  the  energy  of  the  1st  level,  the  energy  of  the  3rd  level  is  three 
times  the  energy  of  the  1st  level,  and  so  on. 

Generally,  the  energy  of  more  cells  is  in  the  middle  levels,  and  energy  of  only  a  few  cells 
is  in  the  upper  levels  and  lower  levels.  Therefore,  we  assume  that  the  number  of  the  cells  with 
specific  energy  is  normally  distributed  over  energy  levels.  The  number  of  cells  at  energy  level 
Eq  is  given  by 


M. 


S  exp  (- 


(5^ 

2o2 


(/  =  1,2 . N,) 


(7) 


where  N/  is  the  total  number  of  energy  levels.  Since  M/  is  generally  not  an  integer  at  an  energy 
level,  it  has  to  rotmded.  In  Eq.  (7),  E  and  5  are  free  parameters  to  control  the  shape  of  the 
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distribution,  and  S  is  so  sdactod  that  the  total  number  of  cells  equals  to  the  given  number  M. 

Figure  2  shows  an  example  with  8  energy  levels. 

We  should  note  the  difference  between  energy  distribution  over  cells  and  the  distribution 

of  the  number  of  cells  over  the  energy  levels.  When  it  is  smaB.  the  number  of  ceUs  is  distributed 
m^by  E;  i.e.,  the  number  of  cells  is  unevenly  distributed  oer  the  energy  levels.  In  this  case,  all 
tite  cells  have  almost  the  same  energy,  E;  i.e.,  the  energy  is  almost  uniformly  distributed  over  the 

cells. 

When  the  energy  is  not  uniformly  distributed  over  the  cells,  most  of  the  formula  given 
in  the  literatures  must  be  modified.  The  probability  of  detection  is  given  by 

where  M,  is  the  number  of  cells  at  the  ith  energy  level.  Corresponding  to  different  target  models, 

^  (0^  For  Multi-Pulscs.  Constant  Amplitude 

the  formula  to  compute  Pdm^  are  given  in  Ret.  (  ). 

Scatterers 

(9) 

=  1-r^  PiV-l,  JV-l  ] 

where  N  is  the  number  of  the  pulses. 

The  threshold,  Yj,  is  still  related  to  false  alarm  probabiUty.  Pf,  by  Eq.  (1).  The  total 
signal  energy  in  a  given  energy  level,  E,  is  related  to  the  total  signal  energy.  E  by 

N/  «/  (10) 

£  = 

where  M,  is  the  number  of  cells  in  the  ith  energy  level.  is  the  base  energy.  Then 


E^  =  i  M,  =  i  E/ij  M, 
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Results  for  Non-Uniformlv  Distributed  Eaergy 

Corresponding  to  different  target  models,  different  number  of  cells,  different  pulse 
numbers,  and  different  energy  levels,  we  have  plotted  numbers  of  figures  in  Ref.  (2)  which  show 
the  probability  of  detection  vs  the  ratio  of  total  energy  to  noise.  Only  a  few  of  them  will  be 
shown  in  this  paper. 

Table  1  shows  the  distribution  of  the  cells  over  energy  levels  for  different  values  of  a 
when  total  number  of  cells  is  10  and  total  number  of  energy  level  is  S  and  1=5.  We  can  see 
that  when  a  is  small,  the  cells  are  distributed  over  a  few  energy  levels  nearby  1,  i.e.  the  cells 
have  nearly  equal  energy.  This  situation  is  similar  as  uniformly-distributed  energy  over  the  cells. 
When  a  =  0.1  all  the  cells  are  in  the  energy  level  E;  i.e.  the  energy  is  uniformly  distributed  over 
the  cells. 


Table  1.  Distribution  of  the  10  Cells  over  8  Energy  Levels  for  Different  Values  of  a  when 
E  =  5 


Energy 

\  Level 

a 

1 

2 

3 

4 

5 

6 

7 

8 

0.1 

0 

0 

0 

0 

10 

0 

0 

0 

0.5 

0 

0 

0 

1 

8 

1 

0 

0 

1.0 

0 

0 

1 

2 

4 

2 

1 

0 

2.0 

0 

1 

1 

2 

2 

2 

1 

1 

3.0 

1 

1 

1 

1 

3 

1 

1 

1 

Figure  3  shows  the  comparison  of  detection  performance  between  uniformly  distributed 
energy  and  non-uniformly  distributed  energy  for  4-pulse  constant  empUtude  scatterers  when  M=2 
and  a  false  alarm  probability  of  l.E-8.  When  energy  is  uniformly  distributed  over  the  cells,  both 
cells  have  the  same  energy;  when  energy  is  non-uniformly  distributed  over  the  cells,  the  energy 
of  one  of  the  cells  is  twice  that  of  the  other. 
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Figure  4  shows  the  comparison  of  detection  performance  between  different  values  of  o 
for  4-pulse  constant  amplitude  scatterers  when  N,  =  8,  M-10,  F=5  and  a  false  alarm  probability 
of  l.E-8. 

In  the  figures,  the  horizontal  axis  denotes  the  ratio  of  total  signal  energy  to  total  noise 
energy  (E/Nq)  in  decible.  the  vertical  axis  denotes  the  probability  of  detection  in  percentage. 
Yj,  is  the  threshold,  M  is  the  total  number  of  cells.  "I  evel"  is  the  total  number  of  energy  levels, 
"Mean"  denotes  E. 

According  to  our  study  for  different  target  models,  number  of  pulses,  energy  levels, 
number  of  cells,  average  energy,  a  values,  and  false  alarm  probabilities,  we  found  the  following 

common  phenomenon  for  every  target  models; 

When  the  false  alarm  probability  is  large  (e.g.  0.01),  the  probability  of  detection  is  not 
significantly  affected  by  energy  distribution.  When  the  false  alarm  probability  become  smaller, 
the  effects  of  energy  distribution  become  more  significant. 

In  addition  to  this  common  phenomenon  for  every  target  mode,  we  have  also  found  that 
the  effects  of  energy  distribution  are  also  highly  affected  by  target  models.  We  found  that  multi¬ 
pulse  constant  amplitude  scatterers  are  most  seriously  affected  by  energy  distribution;  single  pulse 
constant  amplitude  scatterers  are  the  second:  then  it  comes  in  the  order  of  multi-pulse  fast- 
fluctuating-target  dominant  plus  Rayleigh  scatterers,  multi  pulse  fast-fluctuating-target  Rayleigh 
scatterers,  multi-pulse  slow-fiuctuating-target  dominant  plus  Rayleigh  scatterers,  multi-pulse  slow- 
flucmating-target  Rayleigh  scatterers,  single  pulse  dominant  plus  Rayleigh  scatterers,  and  finally 
single-pulse  Rayleigh  scatterers  for  constant  amplitude  scatterers  (single-pulse  or  multi-pulse), 
the  effects  of  the  energy  distribution  are  similar  no  matter  the  probability  of  detection  is  low  or 
high.  However,  for  other  target  models  (Rayleigh  scatterers  and  dominant  plus  Rayleigh 
scatterers),  the  effects  of  energy  distribution  is  significant  in  some  of  the  probability  range  and 
less  significant  in  the  other  ranges.  Generally,  non-uniformly  distributed  energy  will  increase  the 
probability  of  detection  and  the  effects  are  more  significant  in  low  probability  ranges.  The  reason 
is  that  the  threshold  is  hard  to  across  in  that  range  and  the  non-uniformly  distributed  energy 
increases  the  chance  to  across  the  threshold  significantly  for  the  cells  with  higher  energy. 

To  interpret  the  effects  of  the  energy  distribution,  lets  define  the  "cell  probability  of 
detection"  as  Pdn,.  when  the  energy  is  uniformly  distributed.  Then  the  effects  of  the  energy 
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distribution  can  be  explained  by  the  shape  of  the  cell  probability  of  detection.  First,  it  is  helpful 
to  consider  the  problem  with  only  two  cells.  The  probability  of  detection  of  the  first  cell  is  pj, 
and  the  probability  of  detetion  of  the  second  cell  is  P2.  When  the  energy  is  uniformly  distributed, 
Pi^2^  (ceU  probability  of  detection).  The  total  probability  of  detection  is  given  by 

P  =  =  2p  -  p^  (12) 

Now,  lets  assume  the  energy  is  not  uniformly  distributed  and  the  first  cell  has  lower  energy  and 
the  second  cell  has  higher  energy.  Then  pj  becomes  (p-6pi),  and  pj  becomes  (p+6p2).  The  total 
probability  of  detection  becomes 

P  +  6P  =  1-(1-Pj)(l-P2)  =  ^  +  (1-/>)(6P2  -  SPi)  (13) 

therefore 


bP  =  (1-/>)(6/>2  -  6pi)  (14) 

We  can  see  the  trends  of  5P  from  the  plot  of  the  p  vs.  signal  to  noise  ration  in  linear  scales. 
When  the  curve  is  concave-up,  5p2  >  5pl  and  the  non-uniformly  distributed  energy  will 
increase  the  probability  of  detection.  When  the  curve  is  concave-down,  6pl  >  5p2,  and  the 
non-uniformly  distributed  energy  will  decrease  the  probability  of  detection.  The  increment  of 
decrement  will  clearly  depend  on  how  large  is  the  curvature  of  the  curve  of  p,  how  the  energy 
is  distributed  and  how  large  p  is. 

As  an  example,  figure  5  plots  the  cell  probability  of  detection  with  different  values  of  Yj, 
of  6.903  (large  false  alarm)  and  20.72  (small  false  alarm)  for  single-pulse  Rayleigh  scatterers 
when  M-10.  The  figure  shows  that  below  23  db  of  signal  to  noise  ratio  the  curve  is  relatively 
straight  when  Y|,=6.903;  therefore,  the  probability  of  detection  is  not  much  affected  by  the  energy 
distribution  (see  Ref.  [2]).  For  yj,-20.72,  figure  5  shows  that  the  curvature  of  the  curve  is 
relatively  large  in  certain  ranges  of  signal  to  noise  ration.  The  curve  changes  from  concave-up 
to  concave-down  at  about  24  db.  Therefore,  the  probability  of  detection  is  more  significantly 
affected  by  energy  distribution  in  low  probability  range  and  is  not  much  affected  by  energy 
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distribution  nearby  24  db.  And  when  the  signal  to  noise  ratio  is  below  24  db  non-uniformly 
distributed  energy  increases  probability  of  detection  and  when  the  ratio  is  above  24  db,  non- 
uniformly  distributed  energy  decreases  probability  of  detection  (see  Ref.  [2]). 

Conclusion 

The  formula  involved  in  calculation  of  the  radar  detection  performance  have  had 
numerical  behaviors,  and  thus  lead  to  errors  of  the  results.  Our  numerical  methods  developed 
can  overcome  those  difficulties,  and  give  much  more  accurate  results.  Incomplete  Toronto 
function,  modified  Bassel  function  and  open-range  integration  are  main  sources  of  numerical 
problems. 

For  non-uniformaly-distributed  energy,  we  have  found  a  common  phenomena  for  every 
target  model:  the  effect  of  energy  distribution  will  become  more  significant  when  the  false  alarm 
probability  approaches  zero.  However,  the  effect  of  energy  distribution  will  also  largely  depend 
on  the  target  models.  Constant  ampUtude  scatterers  are  more  seriously  affected  by  energy 
distribution,  dominant  plus  Rayleigh  scatterers  are  the  next,  and  Rayleigh  scatterers  are  least 
affected  by  energy  distributions.  Multi-pulse  cases  are  more  seriously  affected  by  energy 
distribution  than  single-pulse  cases.  And  fast  fluctuating  targets  are  more  seriously  affected  by 
energy  distribution  than  slow  fluctuating  targets. 

The  effect  of  energy  distribution  can  be  interpreted  by  a  plot  of  so-called  "ceU  probability 
of  detection"  vs.  signal  to  noise  ratio  inlinear  scales.  When  the  curve  is  concave-up,  then  non- 
uniformly  distributed  energy  will  increase  the  probability  of  dectction.  When  the  curve  is 
concave-down,  the  non-uniformly  distributed  energy  will  decrease  theprobability  of  detection. 
The  increment  and  decrement  will  depend  on  the  curvature  of  the  curve,  the  energy  distribnution 
and  the  ceU  probability  of  detection  at  the  given  energy  to  noise  ratio.  In  most  cases,  non- 
uniformly  distributed  energy  wiU  increase  the  probability  of  detection. 
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Figure  1.  (Jcleclion  probability  for  iiniformly-disiribulcd  energy,  lb-pulse  constant  am¬ 
plitude  scatterers  when  false  alarm  probability  is  I.E-S. 
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Figure  3.  Comparison  of  DcJcclion  Performance  between  I  iniformly  DislribulccJ  Energy 
and  Non-  Uniformly  Disiribuied  Energy  b'r  4-Pulse  Constant  Amplitude 
Sealterers  when  M  =  2  and  False  Alarm  Probability  =  I.E-<S. 


Figure  4.  Comparison  of  Detection  Performance  between  Different  Values  of  u  for 

4-Pulse  Constant  Amplitude  Scatlerers  when  N/=S.M=  10,  E= 5  and  False  Alarm 
Probability  =  I.E-8. 
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Abstract 


We  describe  an  experimental  procedure  to  characterize  cptical 
nonlinearity  and  dynamic  effects  in  fibers.  The  method  is  based  cn  degenerate 
two  wave  mixing.  In  this  method  intensity  dependent  cptical  ncnlinearity,  sudr 
as  the  nonlinear  rrfractive  index,  is  induced  hy  a  high  intensity  laser  beam  in  the 
fiber.  The  nonlinear  effect  is  measured  hy  probing  with  a  weak  probe  signal  time 
synchronize  and  having  the  same  frequency  as  the  pump.  Effects  such  as  pulse 
broadening  and  wavelength  shift  are  also  measured. 
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EXPERIMENTAL  MEASUREMENT  OF  NONLIEAR  EFFECTS  IN  HBERS 
BASED  ON  DEGENERATE  TWO-WAVE  INTERACTIONS. 

EricDonkcr 
L  Introducticn 

Optical  fiber  and  fiber-based  devices  are  becoming  increasingly  important 
in  applicaticns  induding  amplification,  saising,  modulation  and  switdiing. 

Such  applicaticns  are  daived  from  a  varidy  of  nonlinear  optical  phaiomena 
existing  in  fibers.  In  order  to  produce  nonlinear  fiber  devices  it  is  necessary  to 
measure  and  characterize  the  nonlinear  properties,  and  nonlinear  dfects  in 
materials  due  to  intensity  induced  or  picosecond  interactions.  The  material 
properties  of  interest  have  very  small  values.  Also  several  dynamic  effects 
including  pulse  broadening,  wavelaigth  shift,  scattering  et.c  can  occur 
concurrently  and  are  tha-efore  not  easily  isdated  for  measuremait. 

Laser  induced  optical  ncnlinearify  such  as  the  critical  Kerr  effect  and  its 
related  dynamics  has  beai  characterized  extaisiveJy.  Because^  it  is  the  basic 
mechanism  unda-lying  theopa-ation  d  most  all-cptical  devices.  In  principle  an 
intaise  pump  signal  propagating  in  a  Kerr  medium ,  induces  a  third  order 
polarization.  Measuremait  of  the  induced  nonlinearity  can  be  obtained  by 
probing  with  a  weak  signal.  Optical  probing  methods  for  characterizing  the 
nonlinear  refractive  index  and  related  dynamic  effects  indude  the  optical  phase 
conjugate  technique,  modulation  spectroscopy,  hole  burning  spectroscopy,  time 
resoived  methods,  photon-echo  and  magneto-optical  techniques.  Interferometric 
prctsng  mdhods  such  as  the  time  resolved  technique  measure  the  relative  phase 
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change  between  probe  and  pump  trav«-sing  diffo-ait  optical  paths. 

Inherent  fluctuations  and  instabiliities  in  the  different  paths  cause  uncadainties 
in  measurements. 

The  optical  phase  conjugate  method,  through  degenerate  four  wave 
mixing,  is  anotha'  prdbmg  technique  It  utilizes  a  single  source  and  path  fcr  the 
probe  and  pump  signals  and  is  tha'rfore  less  susceptible  to  external  influaice 
However,  spedal  wave  vector  matching  conditions  must  be  satisfied.  The 
difficulty  of  experimentally  satisfying  the  wave-vecrtcr  matching  cxnditions 
makes  this  technicque  less  flexible  as  a  diagnostic  method  for  analyzing  optical 
nonlinearity  and  its  dynamic  d^ects  in  materials  and  devices. 

We  describe  an  experimental  mdhcd  fcr  measuring  the  nonlinear 
refractive  index  and  characterizing  related  dynamic  effect  in  fibers.  The 
methodcfogy  is  based  cn  degenerate  two  wave  interacticns.  A  thecretical  model 
for  the  technicque  is  described  in  Sechon  IL  The  experimaital  method  is  discussed 
in  sedion  IIL 


II.  Theordical  Mcdel. 

Consida-  two  orthcigonal  pcJarized,  pump  and  probe ,  signals 
cxipropagating  in  a  Kerr  medium.  Let  the  peak  pump  power  be  Pp  and  the  probe 
be  Ps,  the  probe  signal  intensity  being  small  ccmpared  with  the  pump.  The 
intaise  pump  signal  induces  a  nonlinear  refractive  index  change  which  can  give 
rise  to  a  shift  in  the  wavelength  of  theprcbe  signaP 


^  P.  L  Balcteck,  R  R  Alfano,  G.  P.  Agrawal,  "Induced- Frequency  Shift  of  Cbprcpagating 
Ultrafast  Optical  Pulses",  y^l.  Phys  Lett  52  (23),  pages  1939  - 1941,  June  198& 
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In  the  above  equation  n2  is  the  nonlinear  refractive  index,  c  is  the  vdoaty 
cf  light  in  free  spacq  Aeff  is  the  effective  cross-sectional  area  of  the  fiber  corq  Xp 
and  Xs>  sre  the  pump  and  probe  wavelengths  respectively.  The  bandwidth  erf  the 
pump  is  Tq,  and  the  parameta"  6  =L/  Lw  where  L  and  Lw  are  the  fiber  length 
and  the  walk-off  respectively.  The  delay  between  the  pump  and  probe  pulses  is 
Td.  Equation  1  suggests  an  indirect  method  fo*  measuring  fiba"  nonlinearity,  that 
is  through  a  measurement  erf  the  wavelength  shift  assodated  with  the  cross¬ 
phase  modulaticn  between  prerfDe  and  pump.  If  a  single  frequency  scjurce  is  used 
fer  the  pump  and  prc±e  the  nonlinear  rdradive  index  simplifies  to : 


«2  = 


cA^ff  Tg 

k  L 


2. 


A  temporal  everfutien  of  the  nonlinear  refradi  ve  index  can  be  determined  if 
there  is  a  time  delay  between  the  pump  and  probe  at  the  input  of  the  fiber 
device. 


II L  E>q>ai mental  Methods 

The  experimental  setup  for  measuring  the  laser  induced  nonlinear  effeds 
in  fibers  is  show  in  fig.  1.  A  lightwave  Eledronics  model  130  pulsed  laser  was 
used  as  the  main  laser  source  for  our  experiments.  The  laser  outputs  about 
70mW  with  pulse  repdition  rate  of  250  MHz  and  a  pulse  width  of  10  pS.  An 
INRAD  MN5-14B  autocerrdator  was  used  to  monito  the  pulses  for  mode¬ 
locking.  The  output  erf  the  autocorrdatcr  was  conneded  to  a  Tdronix  2246  scope 
fer  display  erf  the  pulses. 
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Fig.  1  Set-Up  For  Degenerate  Two-Wave  Measurement 


Fig.  2  Degenerate  Two-Wave  Measurement  Continued 


1(F7 


The  subarcuit  for  tracking  the  pulses  fcr  mode-locking  ccmprise  the 
autocorrelator,  the  two  infra-red  coated  mirrcrs  Ml  and  M2,  and  the  scope.  Ml  is 
fixed  but  M2  can  be  translated  to  reflected  the  laser  beam  to  Ml.  Ml  is 
positioned  to  reflected  the  beam  a  second  time  to  alter  into  the  autoocrrelatcr. 

To  observe  or  adjust  the  laser  for  mode-locking,  set  the  time  ccnstant  cn  the 
autocorrelator  between  2  and  3.  Move  M2  and  cr  adjust  the  tilt  controJs  on  Ml 
and  M2  for  a  signal  to  register  on  the  scope  Tune  the  laser  cavity  to  obtain 
mode-locked  pulses.  The  laser  controiler  allows  fcr  tuning  the  cavity.  Amcnitcr 
ccnnected  to  the  laser  contrcJla'  displays  the  settings  erf  the  various  laso" 
parameters  during  tuning.  The  laser  cavity  is  calilrated  in  picoseconds  and  can 
be  tuned  from  1200ps  to  1700  ps.  Mode-locking  cf  the  laser  pulses  occurred 
between  1320ps  to  1370  ps.  Once  mode-locking  is  adiieved  M2  is  removed  to 
allow  the  laser  source  into  the  rest  erf  the  dreuit 

A  Gould  5:95  2x2  coupler  was  used  to  split  the  scxirce  into  pump  (95%) 
and  probe  (5% )  signals.  The  fiber  couple-,  coupl  es  the  source  into  the  splitter  Cl . 
Approximately  20  m  W  erf  the  source  signal  was  coupled  into  the  fiber.  The  probe 
was  connected  to  a  Laser  Preasicn  Corporation  DB  2900  single  mode  variatie 
attenuater.  The  output  erf  the  attenuatcr  was  ccnnected  to  a  perfarizatien 
ccntrcrfler,  PCI.  The  pump  was  connected  to  an  electro-opticmodulatcr  through 
thepciarization  contrcrfler,  PC2.  The  output  of  the  modulatcr  was  also  ccnnected 
to  a  polarization  controller,  PC3.  The  2x1  coupler,  C2,  was  used  to  combine  the 
probe  and  pump  signals.  A  JDS  FITEL  SN  837  analyzer  was  used  at  the  output 
fcr  differentiating  bdweai  the  probe  and  pump.  The  fiba-  unda-  test  (PUT)  was 
placed  between  the  output  of  C2  and  the  analyzer.  The  fixed  analyzer  method 


10-8 


requires  that  the  probe  and  pump  signals  be  crthogonally  polarized  at  both 
input  and  output  pcrts  of  the  FUT.  This  condition  is  verified  in  Expenment  1. 


ExpCTiment  1. 

The  variahJe  attenuator  was  set  to  attenuate  the  probe  signal,  the 
modulator  was  then  biased  for  maximum  pump  power.  The  oontrdlers  PC2  and 
PCS  wa-e  adjusted  for  minimum  pump  powa  reading  on  the  meta,  a  reco-d  of 
the  analyza  angle  was  taken.  Without  adjusting  the  pdariza  the  analyzer  was 
rotated  till  the  powa  meter  registers  a  maximum  again.  The  rdation  of  the 
analyzer  for  maximum  pump  power  was  approximately  90  degrees.  The  prdDe 
signal  was  then  let  through  with  the  pump  still  off.  The  analyza  was  rotated  to 
the  previous  angle,  and  the  probe  power  was  recorded. 


Results  For  Expaiment  1. 

Table  1:  Orthogcxial  Pdarization  d  Pump  and  Frcba 


Analyza  Ang. 

Probe  Att. 

Prbbe  Powa 

Pump  Power 

Pump  Vdt. 

270° 

38  dB 

2.0  fiW 

0.5  nW 

2.9  V 

180° 

38  dB 

2.0 

4.4  mW 

2.9  V 

180° 

38  dB 

2.0  fiW 

6.0 

7.0  V 

180° 

38  dB 

2.0  nW 

80.0  ^iW 

7.0  V 

270° 

3dB 

468.0  nW 

80.0  jaW 

7.0  V 
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The  extinction  ratio  erf  the  pump  and  probe  calculated  from  the  above  table  were 
39  dB  for  the  pump  and  23  dB  for  the  probe.  This  results  shows  the  two  signals  to 
be  cxthqgonally  polarized. 

Experiment  2 

In  this  experimait  we  studied  the  dynamic  rffeds  erf  ceprepagating 
pump  and  probe  in  a  2  kilometa-  Fiber.  The  PUT  was  connected  betweai  the  2x1 
coupler,  C2,  and  the  input  to  the  analyzer.  The  cjutput  erf  the  analyzer  was 
cxnnecrfed  to  an  cptical  spectrum  analyzer  (Anritsu  MS  90).  Theprerfe  and  pump 
signal  were  adjusted  to  be  crthogenally  polarized  at  the  input  erf  the  fiba-.  The 
modulator  Irfas  verftage  and  the  attenuatex  were  adjusted  to  allcw  fer  the 
transmission  of  maximum  pump  and  probe  signals  through  the  PUT  The  spectra 
erf  the  output  signal  was  measured  by  the  spectrum  analyzer.  The  spectra  of  the 
preibe  only  and  the  pump  only  signals  wae  also  measured. 

Eesults  For  Experiment  P 

The  measured  spectra  are  shown  in  fig  3a-3d.  Pig.  3a  represent  the  spectra 
erf  theprerfD^  fig.3bis  fex  the  pump  and  two  spectra  erf  the  modulated  probe  are 
shc3wn  in  fig.  3c  and  fig.  3cl  The  peak  erf  theprobe-cxily  and  thepump-cxily 
signals  are  obtained  from  fig.  3a  and  3h,  and  both  occurred  at  a  wavelength  of 
1.32170(j,m,  and  the  peak  erf  the  modulated  probe  signal  occurred  at  1.32166jAm 
However  the  change  was  cxi  the  same  order  as  the  resetutien  erf  the  scepe  and 

f 

therrfexeregared  as  insignificant.  The  availabJe  pump  power  was  fcxind  to  be 
rather  small  to  inducer  any  significant  wavelength  shift  erf  the  mexiulated  preibe 
signal. 
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The  probe-only  and  pump-only  spectra  are  similar.  In  that  they  both 
depict  similar  rise  and  fall  times.  The  bandwidth  d  the  probe-only  and  the 
pump-only  signals  are  0.304  nm  and  0.273  nm  respectively.  In  contrast,  the 
modulated  probe  signals  depict  remarkably  slow  rise  and  fall  times.  In  addition, 
the  fall  times  appear  to  be  ocmposed  of  two  time  constants,  with  the  tail  end 
much  slower  than  the  leading  edge.  The  bandwidth  of  the  modulated  signal  is 
0.432  nm  This  is  35%  large  than  the  pump-only  signal  and  28%  higher  than  the 
probe-only  signal. 

The  results  indicate  the  cxxurrence  erf  ncnlinear  defects  in  the  fibo"  due  to 
the  degenerate  interacticn  of  copropagating  signals.  The  results  fer  the  pulse 
hroadaning  was  significant  although  due  to  pump  power  limitations  additional 
experiments,  at  higher  power,  could  not  be  performed  to  identify  the  specific 
mechanisms  invdved. 
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CIRCULAR  WAVEGUIDE  TO  MICROSTRIP  LINE  TRANSITION 


Frances  J.  Harackiewicz 
Assistant  Professor 
Department  of  Electrical  Engineering 
Southern  Illinois  University  at  Carbondale 

Abstract 


A  circular  waveguide  to  microstrip  line  transition  was  analyzed  with  a  spectral-domain 
Green  function  /  method  of  moments  technique.  The  goal  of  this  work  was  to  obtain  the 
equations  for  the  moment  method  matrices  as  they  appear  in  Appendix  D.  These  spectral 
integral  expressions  for  the  matrix  elements  take  into  account  all  the  available  symmetries 
in  the  spectral  polar  coordinates  L  and  a.  Beyond  this  goal,  much  of  the  analysis  was 
written  into  numerical  subroutines.  When  run  together  the  subroutines  will  be  a  useful 
design  tool.  This  written  computer  code  will  help  engineers  design  a  reliable  transition 
through  the  cooling  boundary  layer  to  the  rf  circuitry  in  a  high  density  microwave 
monolithic  integrated  circuit  package. 

The  second  goal  of  this  work  was  to  develop  a  postprocessing  routine  which  will  calculate 
the  reflection  coefficient  on  a  microstrip  line  given  sampled  values  of  the  current  on  that 
line.  A  fairly  robust  routine  described  in  the  text  was  developed. 
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CIRCULAR  WAVEGUIDE  TO  MICROSTRIP  LINE  TRANSITION 


Frances  J.  Harackiewicz 


Introduction 

Presently,  multilayer  microwave  monolithic  integrated  circuits  and  their  associated 
packaging  use  vertical  conducting  interconnects  to  couple  the  if  signal  between  layers. 
These  vertical  connectors  may  break  and  lose  electrical  contact.  Such  a  break  can  be  very 
difficult  to  fix  if  it  is  embedded  in  the  circuitry.  Solderless  fuzzball  connections  are 
sometimes  used  in  an  attempt  to  overcome  this  problem  in  high  density  packaging,  but 
they  are  neither  trustworthy  nor  satisfactorily  simple  to  construct  or  to  analyze. 

Several  geometries  of  layers  electromagnetically  coupled  without  vertical  interconnects 
are  also  possible  and  some  have  been  studied  [1].  The  general  circular  waveguide  to 
microstripline  (cwg2ms)  transition  is  shown  in  Fig.  1.  The  special  case  with  ys=0,  e2=e3=eo 
is  studied  here.  This  geometry  was  suggested  by  M.  Davidovitz  [2]  who  built  and 
measured  such  a  transition  which  had  a  good  VSWR  response  over  a  promising  frequency 
range.  The  cwg2ms  transition  should  be  useful  in  coupling  to  circuits  with  thick  ground 
planes  used  as  a  heat  sink  or  as  a  boundary  between  liquid  N2  cooled  circuitry  and  the 
remaining  circuitry  at  ambient  temperature.  A  hole  drilled  in  the  thick  metal  ground  plane 
is  easily  machined  and  becomes  the  circular  waveguide. 

The  goal  of  this  research  is  to  do  the  analysis  necessary  prior  to  writing  an  efficient 
computer  code  to  analyze  the  cwg2ms  transition.  The  code  should  be  fast  enough  to  be 
used  by  engineers  as  an  interactive  design  tool  and  accurate  to  within  1%  to  3%  of  the 
experimental  results.  The  code  should  be  portable  and  independent  of  any  non-public- 
domain  libraries  except  for  the  programming  language  FORTRAN  77. 
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Top  view: 


y 


The  aperture  exists  at  p  <  a,  0  <  (j)  <  27U,  z  =  0. 

The  substrate  Ei  exists  for  -oo  <  jc  <  oo,  -  oo  <  j  <  <»,  0  <  z  <  d. 

The  plug  82  exists  for  p  <  a,  0  <  (j)  <  27t,  -  dj  <  z  <  0. 

The  83  waveguide  filling  exists  for  p  <  a,  0  <  (j)  <  27t,  z  <  d2 . 

Figure  1  -  Geometry  of  the  Circular  Waveguide  to  Microstrip  line  Transition 
Methodology 

The  spectral-domain  Green  function  /  Method  of  Moments  (MoM)  is  used  to  analyze  the 
cwg2ms  transition.  Waveguide  modes  are  used  to  expand  the  equivalent  magnetic  current 
over  the  ground  plane  at  the  waveguide  aperture  A,  and  overlapping  triangular  pulses 
(rooftop-modes)  are  used  to  expand  the  induced  electric  current  on  the  microstrip  line. 
The  two-dimensional  spectral  integrals  are  done  numerically  using  Chebyshev  and 
Gaussian  quadrature  routines.  The  Green  functions  relate  the  vector  fields  and  sources  on 
the  top  and  bottom  boundaries  of  the  dielectric  substrate  of  the  microstrip  line.  The 
integral  equations  and  moment  method  matrix  elements  are  defined  in  Appendix  D. 

After  the  MoM  procedure  is  used  to  obtain  the  current  on  the  strip  S,  some 
postprocessing  is  necessary  to  obtain  the  S  parameters  of  the  transition.  A  postprocessing 


z  IS  the  waveguide  axis. 

X  is  the  direction  of  the  line. 
The  strip  exists  for  : 
<x<x^+L 
w  w 

z  =  d 


W 


Left  side  view: 
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method  similar  to  finding  Fourier  coefficients  was  developed  for  this  purpose  and  is 
summarized  in  the  next  section.  Since  this  technique  involves  numerical  integration,  one 
can  assume  that  the  method  is  more  robust  than  one  that  has  no  “averaging”  in  it. 

Summary  of  new  Postprocessing  Technique 

The  current  on  the  strip  is  known  as  a  series  of  weighted  rooftop  functions  from  the 
MoM,  yet  the  true  current  is  assumed  to  be  of  the  form 

/(x)  =  A^g-^'P^Cl  + 

The  postprocessor  must  find  the  values  of  A'^  (magnitude  and  phase),  R  (real),  and  (j)  (real) 
while  assuming  that  p  is  known  by  some  other  means  such  as 

'k 

The  rooftop  sample  points  are  spaced  every  from x  =  0tox  =  X.j/2  where  N  is  an 
even  number  between  3  and  20.  So,  the  sample  points  are 
ik 

X:  =  — -  fori  =  0,1,2,...,N . 

'  IN 

Three  of  the  unknowns  can  be  determined  by  first  considering  only  the  magnitude  squared 
of  the  current 

|/(x,)p=lA"|'[l  +  R^  +  2/?cos((t)  +  Ini  /  A)]  where  the  ranges  of  the  unkowns  are 
|/(x,)|/>/2  <|a^|<oo,  0<R<1,  0<(t)<27:. 

Three  coefficients  are  defined  and  can  be  calculated  from  the  current  at  the  sample  points 
by  using  Simpson’s  integration  rule  [4]: 
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a,  =  3iV^  \\I{xfdx  =  3iV|Af  (1  +  R^)  =  3iV|Af  (1  +  R^) 

^  0 

=  |/(j:o)r  +4|/U.)r  +2|/(x2f +-+4|/(x^.,f  +|/(;c^f 

“2  =3A^-  J|/(x)pcos(2p,r)d[2:  =  3iV|A^|'/?cos(t) 

=  |/(Xo  )p  cos(2Pxo )  +  4|/(x.  )p  cos(2Px, )  +  2- •  •+4|/(x,,_,  )f  cos(2Px^., )  +  |/(;c^  )p  cos(2Px^  ) 

a 3  =  3iV^  J|/U)p  sin(2Pjc)d[x  =  -3Ar|A"|'i?sin^ 

=  |/(x  j|%in(2pXo )  +  4|/(x,  )|  %in(2Px  J  +  2.  •  .+4|/(;c^_,  f  sin(2P;c^_, ) + |/(x^  f  sin(2Px^  ) 

Then  solve  for  (|)o,  Ro,  and  Ao  using  the  above  a’s. 

([)  =(t)o  =  arctanC-ttj  la^) 

The  atan  function  in  FORTRAN  returns  a  (j)o  between  -nil  and  nil.  So,  the  following 
contingencies  must  be  placed  into  the  calculation: 


if  (l)o<0  then 

if  tta  <0 

<t>0— ^t>0+7C 

else 

if  a2  >0 

(|)o=(t)o+27C 

end  if 

if  a2  <0 

(t)o=(|)o+7r 

To  solve  for  Rq  define  box  Q  as  below  which  leads  to  a  quadratic  equation. 

r-,_  R^  al+al  ,  1 

^  ^  7 - 7TF  = - 1 —  and  note  that  0  <  □  <  — .  So, 

(l  +  R')  cxf  4 

□/?“*  +  (2Q  - 1)/?^  +  □  =  0  which  can  be  solved  by  letting  q=l  and  p=l-l/(2G).  Note  that 
-oo  <  p  <  -1  so  that  sgn(p)=-l.  Solving  the  quadratic  leads  to  [5] 

Rx  =-p-  ?>gn{p)^j -  q  if  p^  >  ^  and  p  ?£  0 
Rl=qlR^ 

Then,  R^  =  R^  =  R^  but  if  7?^^  >  1  then  R^  =  q  /  R^  and  R^  =  . 
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The  code  calculating  Ro  must  also  be  written  with  contingencies  to  avoid  two  possible 
numerical  problems.  Notice  that  >q  =  \  will  always  be  true.  A  numerical  problem 
arises  if  p^-q  should  be  zero,  but  is  a  small  negative  number  instead  due  to  roundoff  error. 
Another  numerical  problem  could  arise  if  0=0  and  then  p=-<»,  R=0.  So,  following  is  the 
corresponding  code 

q=l 

if  (□=0)  then  Ro=0 
else 

if  p^<q  then  Ro^=-p 

else  Ro^=  q  /  ^-p  +  ■<^p^  -q^ 

end  if 
end  if 

To  calculate  the  amplitude  use  IA'^I  s  A„  =  ^ ^ . 

By  the  Sampling  theorem  it  seems  that  4  sample  points  per  X,g  should  be  OK,  but  after 
running  several  test  cases  it  was  found  that  at  least  6  sample  points  were  necessary. 
Perhaps  these  extra  points  are  needed  because  we  are  using  Simpson’s  Rule  for  integration 
rather  than  a  trapezoidal  rule.  In  Table  1  are  examples  of  given  lA^I,  R,  and  (j)  and  the 
corresponding  calculated  Aq,  Ro,  and  (j)o  for  a  given  2N  number  of  samples  per  guide 
wavelength. 


Table  1  -  Examples  of  actual  and  approximated  values  from  postprocessing  routine. 


2N 

IA"I 

R 

<|) 

Ao 

Ro 

<|)o 

20 

1 

1 

1.8 

1.00000 

1.00000 

1.80000 

6 

1 

1 

1.8 

1.00017 

0.999655 

1.80000 

4 

1 

1 

1.8 

1.08089 

0.917337 

3.14159 
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The  phase  of  also  needs  to  be  calculated  from  a  sample  point  of  I(x),  which  includes 
phase  information. 


e 


j^A 


A^|(l  +  Re^'^c'’^P"') 


Planning  the  computer  code 

One  way  to  construct  the  flowchart  of  the  main  program  and  some  of  its  subroutines  is 
sketched  below. 


Filling  the  MoM  matrix  involves  doing  several  two-dimensional,  infinite,  spectral  integrals 
and  is  the  most  time-consuming  part  of  the  numerical  calculation.  The  polar  spectral 
variables  are  a  and  kt  and  care  must  be  taken  so  that  integrating  over  them  is  accurate  and 
efficient.  On  the  kt/ko  axis  between  1  and  there  will  be  poles  of  the  Green 

functions  if  there  is  no  loss  in  the  substrate.  If  there  is  some  loss  in  the  substrate,  then  the 
poles  will  move  into  the  complex  plane  but  still  be  close  enough  to  the  real  axis  to  cause 
some  rapid  changes  in  the  integrands  evaluated  on  the  real  axis.  One  must  smartly  choose 
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the  sample  points  and  integration  method  to  best  fit  with  this  type  of  function.  One 
possible  way  to  constmct  the  flowchart  for  the  MoM  matrix  fill  is  shown  below. 


There  are  three  interactions  AA,  AS,  and  SS  which  correspond  to  three  different  types  of 
integrals  which  must  be  done  over  kt  and  over  a.  The  two-dimensional  range  of 
integration  can  be  illustrated  with  the  area  of  a  rectangle  shown  below. 


nil 

0 


0 


V(erltlrl)" 


large 


The  a  integration  is  done  analytically  for  the  AA  interaction,  but  must  be  done  numerically 
for  the  other  interactions.  The  kt  integration  from  0  to  1  can  be  done  in  one  step  with  a  64 
point  Gaussian  quadrature  routine  for  all  three  interactions.  The  kt  integration  from  1  to 
must  be  done  in  a  number  of  steps  using  a  grid  that  is  sensitive  to  the  regions 

where  the  Green  functions  have  poles.  All  three  interactions  have  poles  at  the  same 
locations,  but  some  Green  function  elements  are  worse  behaved  than  others.  To  determine 
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the  grid  to  use,  a  public  library  quadrature  routine  with  an  adaptive  grid  is  first  used  to 
integrate  over  the  worst  behaved  Green  function.  By  testing  it  was  found  that  the 
integrand  for  this  preliminary  integral  should  be  the  imaginary  part  of  hmtm  if  only  one 
surface-wave  pole  exists  and  the  imaginary  part  of  hmtm  times  the  imaginary  part  of  hmte 
if  more  than  one  surface-wave  pole  exists.  The  value  of  large  is  different  for  each 
interaction.  It’s  value  can  be  guessed  at  from  the  electrical  dimensions  between  bases 
involved  for  each  interaction.  Then  large  can  be  increased  from  those  points  until 
convergence  of  the  final  answer  is  achieved.  For  the  AS  and  SS  interactions,  the  a 
integrations  must  also  be  done  numerically.  A  Chebyshev  integration  suitable  for 
integrating  over  sinusoids  is  used  in  this  dimension.  The  integration  grid  in  the  a  direction 
is  always  found  from  the  worst  case  (largest  kt  and  largest  variation  of  the  integrand)  kt 
point  for  each  particular  set  of  64  kt  points.  Much  of  this  code  and  the  MoM  equations  are 
similar  to  those  in  [3]. 

Conclusion 

A  computer-aided  design  tool  for  a  transition  reliable  in  high  density  feeds  for  phased 
array  antennas  is  desired.  The  circular  waveguide  to  microstripline  transition  can  be 
analyzed  with  the  spectral -domain  /  method  of  moments.  All  of  analytical  work  necessary 
to  implement  this  analysis  method  in  a  computer  program  has  been  done.  This  analytical 
work  included  deriving  the  moment  method  matrix  element  equations  and  developing  a 
new  post  processing  technique.  The  post  processing  technique  was  programmed  and 
verified  separately  as  were  the  routines  calculating  the  Green  function  elements  and  the 
Fourier  transforms  of  the  basis  functions.  The  individual  pieces  of  code  work  well.  A  plan 
was  well  thought  out  of  a  good  way  to  link  the  pieces  of  code  together.  Future  work  will 
include  piecing  together  the  code  into  one  analysis  program  and  comparing  the  results  to 
the  experimental  data. 

The  following  three  figures  show  what  the  Green  function  elements  and  the  basis  functions 
look  like  in  the  spectral  domain. 
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lm(Green  fncs.  normalized)  Re(Green  fncs.  normalized) 
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Figure  2  -  Real  and  inaginary  parts  of  the  normalized  Green  function  elements  from 

Appendix  A. 
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alpha  (degrees) 


Figure  3  -  Magnitude  and  phase  of  the  Fourier  transforms  of  a  few  of  the  20  roof-top  basis 

functions  on  the  strip 
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btzxkt(i) 


0.1 


btkt 

“T— 


btzxkt 


Figure  4  -  Components  of  (fc, )  for  the  first  three  modes  of  a  circular  waveguide.  These 

functions  are  defined  in  Appendix  B. 
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Appendix  A  Green  Functions 

Listed  below  are  the  spectral-domain  dyadic  Green  functions  used  in  the  program.  The 
two  superscripts  above  the  G  refer  to  the  type  of  observation  field  and  source  current. 

=  HJ  ^  ^  ^  ^ 

G  (0, d^)-ki ihjte)k,  xz  +  zxki (hjtm)kj 


HM  ^  ^  ^  ^  ^ 

t\^G  (0,0)  =  k,{hmte)k,  +  z'xk,(hmtm)zxk, 

^=EM  /N  yv  ^  ^  ^ 

G  {dj  ,0)  =  k,x  z{emte)k,  +  k,  (emtm)z  x  k, 

G^\d^,d^)/'^„  =  k,{ejtm)k,  +k,xz{ejte)k,xz 

hjte  =  -f\<d,d,) 
hjtm  =  -I™{0,d^) 
hmte  = (0,0) 
hmtm  =  -Ti„/™(0,0) 
emte  =  —hjte 
emtm  =  —hjtm 
ejtm  =  -V™(d„d,)/y\„ 
ejte  =  -V^^(d„d,)/T]„ 

V(d„d,)  =  \/iY^-jY,cot(k^,d,)) 

I(0,d^)  =  jY^V(d^,d^)lsin(k^^d) 

4(0,0)  =  -yT,(F„  cot(k^,dO  +  jY,)V(d„d,) 

k^,/k,=±yl\-(k,/k,f  lm<0 
Ki/ko  =±^|^,^\i,^-(k,/ko)^  lm<0 

T]X"=(Ko/K) 

^,Y™  =(k,/k^,)  if  k^^O 

r\X"  =  (kJVirXfK) 
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Appendix  B  Basis  Functions  in  the  Aperture 

The  tangential  electric  field  in  the  waveguide  can  be  written  as 

£:™v(p,(l))  =  e„(p,(t))  (V  /  m) ,  and 

n=l 

the  magnetic  field  in  the  aperture  of  the  waveguide  (a>p,  27t>(|)^,  z=0  aperture  A)  is 

_  1  _  ^  M  _ 

Htan=—zx  Etan  =  X =  "X (K  +  K)hn  (V  /  m) 


where  the  impedances  of  the  waveguide  modes  are 


V  \K<)  ^ 

where  and  ti,  =  VmT^ 

and  the  transformed  waveguide  modes  are  given  by 

K^i™  =zx  k,c^^  ia)B™  ik,)-zxk, 

kX^^  =  k,c„^ {a)bl\k, ) •  ^,  +  £ X {a)b^{k, )-zxk, 


K'K 


(XT-C 


zxk, 


br=«jj,u'M~K‘‘) 

k„ 


cos(ma) 


c„xa)=r-‘2K\  . ;  ; 

[sin(Ana) 


~ii,  v'  I.  f  1 

k  a  ^  \  t  oj 

l  j  [kMl 


coscj)  dependence 


sin(|)  dependence 
feosej)  dependence 
[sin<t)  dependence 
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AoDendix  C  Basis  Functions  on  the  Stri 


The  microstrip  line’s  left  edge  is  centered  at  (Xs,ys)-  The  line  has  width  w  and  length  L. 
The  electric  surface  current  on  the  strip  is  expanded  in  triangular-pulse  functions  given  by 

—  — for  Ix-x  -/n/il< /land  ly-ylov/l 

[  0  else 

The  Fourier  transform  of  the  pulses  is 


Py  (ky )  =  sinc(/:^. w  /  2) 

=  k,  cosa,  k^.  =  k,  sin  a 

Appendix  D  Method  of  Moments  Equations 

The  tangential  fields  in  the  waveguide  and  waveguide  aperture  are 

E  TAN  ip  A  z)  =  e„ipA  (V  /  m) 

n=l 

H  TAN  =  - Z  X  EtAN 

M5(p,(|),0)  =  -£x£r^^(z  =  0)  =  -X(K' +K0zx^'' 


The  current  on  the  strip  is  Js  =  J n{x,y,d)  (A  /  m) . 


The  given  coefficients  Vj  describe  the  incident  field  from  the  waveguide.  The  unknown 
coefficients  VJ,  can  be  solved  by  using  the  spectral-domain  Galerkin  method  on  the 
boundary  condition  MFIH  and  EFIE 

(*)  Hr/w  (z  =  O'" )  =  H  wv  (z  =  0” )  in  aperture  A 

(**)  El%‘^iz  =  d)  =  0  on  the  Strips 
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or 


FT(*) 

FT(**) 


=  ™  ^  =HJ  =  ^  M  ,  U 

Goo  •m  +  Go.-7,  =  //,^(z  =  o-)  =  £(\/'-v;)^ 


rt=l 


=  j=.  =Ey  - 

Grfo  •  +  G(/f/  •  =  0 


where  ~  means  Fourier  Transform  defined  as  /  =  JJ  /e-'*'  ^  Jp  . 

After  Galerkin  weighting  the  MoM  matrix  equation  showing  units  and  dimensions  is 


Xh 

._K_ 

{NxM)x\ 


-AS 


r~  n, 

zV 

Ao 


f  5  ^ 

I  mn 


V 


-T 


5A 


-I 


Tl( 


y  5  ^ 

y  AA  ^  mn 

V  y 

y  5A 


"TvT 

A/xl 


(1) 

(A^+iW)x(Af+A^) 


where  the  submatrices  are  defined  as 


-AS 


/'SS 

‘*mn  /  —  , 


■K^  Jj  (jto/  ”  A  Ao  Jv^^V 


(l)MxM 

(l)MxN 

(l)NxM 

(l)NxM 

(l)NxN. 


'^0  47C^ 

Most  terms  are  made  dimensionless  by  dividing  by  either 

^0  ~  ®-\/l^0^0  or  TJq  =  yj\lQ  /Eq  . 

The  double  infinite  integrals  can  be  done  in  polar  spectral  coordinates  where 
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0  if  nip  ^ 

orXp  ^TE 

r,  =]  orx^^TE 

n  else 

0  if  nip  ^  m^  or 

if  Xp  ^x ^  and pcos^  =  qcos^ 

-K  elseifXp  5^x^  and/7Cos(|)  (?cos(t)  and/7^„3^  =TE^ 

K  else 

Only  the  upper  triangle  of  the  matrix  including  the  diagonal  need  be  calculated  since  this 
matrix  is  symmetric: 

TloC=^oC- 

The  MxN  matrix  T: 
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JAS  _  7  k, 


rn\ 


=  HJ 


bp  {k,  )k^  ■k\k,-God  -k^xz  /J,  {k, ) 


V'^o  A  •  £  X  fc,[  £  X  fc,  ■  Gw  •  ^, 

where  C^  =  271/™“' ’^2 -6 


)Uk) 


and  where 


Om 


c =(-')■'■' j 

0 


2Kf  qTM 


if  p  is  TE  and  cos(|)  harmonic 
if  p  is  TM 

4  jRe{F^^^o}^>,  sin^mpa^sinctcia  if  p  is  TE  and  sin(|)  harmonic  and  mp  is  odd 


It/2 


It/2 

■4j  J ^0  }^v,  sin^m^a^sin ada  if  p  is  TE  and  sin(j)  harmonic  and 


mp  IS  even 


PI 


27C, 

f 

Ym  \ 

J 

0 

y  '”p  J 

F^k^F^.  cosou/a 


=  I  ±4  J RejFj^^feo  jF,.^  cos^/TZpa^cosocJa 
0 

k(2 

J cos^/nptt^cosaJa 


if  p  is  TE  and  cos(j)  harmonic 

if  p  is  TM  and  sine])  harmonic 
f  +if  p  is  TM,  cos(|)  harmonic  and  mp  is  odd 

|-if  p  is  TE  and  sinej)  harmonic  and  mp  is  odd 
f-ifp  is  TM  and  cos(()  harmonic  and  mp  is  even 
l+if  p  is  TE  and  sin(|)  harmonic  and  mp  is  even 


The  two  T  matrices  are  related  with  a  transpose  and  a  minus  sign  Tp^  =  —T^p  which 
comes  from  the  two  Green  functions  being  related  G™  =  (-G"^  Y . 

The  NxN  matrix  Z: 


Z^^/TI 

mn  \ 


I 


(kA 

(kA 

d 

J 

yko  J 

Kn 

■  J  rfajeos^  a(ejtm)  +  sin ^(e7re)}cos(^o/jcosa(n - /n))sin c* (— cosa/z^o )sin c^ik^ 
0  2 

Notice  that  Z  has  Toeplitz  symmetry  so  that  only  the  first  row  of  elements  need  be 
calculated. 


sin  a — ) 
2 
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Simulation  of  Erbium-doped  Fiber  Lasers 


J.  W.  Haus 
Physics  Dept, 

Rensselaer  Polytechnic  Institute 
Troy,  NY  12180-3590 


Abstract 


Numerical  simulation  of  pulse  amplification  in  birefringent  optical  fibers 
has  been  examined  with  the  goal  of  providing  a  reaUstic  and  accurate  de¬ 
scription  of  passive  mode-locking.  Emphasis  of  our  studies  was  put  on  the 
evolution  of  the  pulse’s  polarization.  We  found  a  complex  polarization  state 
of  the  amplified  pulse;  this  has  important  implications  for  the  operation  of 
fiber  lasers  using  a  birefringent  fiber  amplifier  mode  locking  element. 
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!•  INTRODUCTION 


Erbium- doped  fiber  lasers  are  a  potential  ultrashort  pulse  source  for  optical  commu¬ 
nications  technologies  [1].  They  provide  high  repetition  rates  in  the  wavelength  regime 
corresponding  to  the  minimum  loss  of  silica  fibers.  Such  lasers  have  distinct  advantages, 
for  instance,  they  can  be  pumped  using  diode  lasers  and  they  are  compatible  with  fiber 
technologies.  However,  they  suffer  from  stability  and  reproducibility  problems  because  of 
their  complex  dependence  on  the  polarization  of  the  light  in  the  laser  cavity  [2,3].  Recently, 
this  problem  has  been  examined  and  new  cavity  designs  have  been  proposed  to  operate  with 
ultrashort  pulses  while  being  insensitive  to  environmental  changes  [5-'12].  The  new  cavity 
designs  incorporate  birefringent  fiber  amplifiers  as  elements  to  control  the  polarization  state 
of  the  light  in  the  fiber.  Polarization  mode  locking  combines  the  uses  of  birefringent  fibers 
with  polarization  elements  to  create  fast  saturable  absorber-like  action. 

Generally,  when  a  soliton  pulse,  which  has  a  well  defined  polarization  state,  propagates 
through  a  birefringent  fiber,  then  cross-  and  self-phase  modulation  effects  act  to  rotate  the 
polarization  state  of  the  pulse  in  addition  to  the  linear  birefringence  of  the  fiber.  When 
that  fiber  is  an  amplifier  though,  the  perturbation  of  the  pulse  phase  and  amplitude  can 
be  large  enough  that  the  polarization  varies  across  the  pulse.  Iji  this  situation,  the  mode¬ 
locking  element  no  longer  optimally  functions  as  designed.  The  result  will  be  a  pulse  that 
is  distorted  from  its  original  shape. 

II.  MODEL  AND  SOLUTION  METHOD 

We  simulated  a  model  for  erbium-doped,  birefringent  fibers  and  examined  the  evolution 
of  the  pulses  in  the  amplifier.  The  simulation  model  incorporated  features  of  the  birefringent 
fibers  [13]  and  amplification  [14].  The  simplest  form  of  the  evolution  equations  are  expressed 
as: 

i  (^  +  A^)  +  E,{\E,\^  +  (1) 


12-3 


(2) 


+  mEy?  +  IIB.P)  =  iTE,  J- 


Ex  and  Ey  are  the  electric  field  vector  components  along  the  x—  and  y—  axes,  resp.  ; 
propagation  is  along  the  z— axis.  To  simplify  the  expresions  the  equations  have  been  scaled 
to  soliton  units  throughout  [15].  The  time  is  scaled  to  a  value  To,  related  to  the  initial  pulse 
width;  the  length  is  scaled  by  the  dispersion  length  Ljj  =  Tq/\I32\,  where  ^2  is  the  group 
velocity  dispersion  parameter.  The  field  amplitude  scaling  corresponds  to  a  fundamental 
soliton  ,  n2\Eo\^  =  \^2\^Tq.  The  left  hand  side  of  Eqs.  1  and  2  includes  the  scaled  inverse 
group  velocity  difference,  2A,  between  the  two  polarizations,  and  the  effect  of  self-  and  cross¬ 
phase  modulation  [15,16].  We  have  neglected  the  four-wave  mixing  terms,  which  rapidly 
oscillate  with  propagation  distance  in  a  birefringent  fiber  and  therefore  average  to  zero. 

On  the  right  hand  side  the  gain  and  gain  dispersion  coefficients  are  modeled  after  a 
two-level  atom  model  for  the  medium  with  a  rapid  polarization  relaxation  time.  The  gain 
and  gain  dispersion  parameters  are  T  and  fj,.  The  equations  were  solved  using  a  split-step 
Fast  Fourier  transform  method,  which  was  modified  to  take  advantage  of  the  vector  nature 
of  the  equations  [17].  The  linear  contributions  are  incorporated  into  a  linear  operator  that 
can  be  solved  in  the  frequency  domain.  The  nonlinear  contributions  are  treated  in  the  time 
domain.  We  generally  used  4096  FFT  points,  but  runs  have  been  made  with  8192  FFT 
points  to  assure  accuracy.  The  step  size  along  the  z-  axis  is  10"^  and  our  time  window 
has  varied  from  80  to  160  units.  Our  results  were  tested  against  previous  results  for  special 
cases.  The  input  pulse  shape,  i.e.  2  =  0  is  a  fundamental  soliton  with,  in  general,  unequal 
amplitudes  along  the  x —  and  y —  axes: 


Ex{0,t)  =  cos^sech(r); 

EyiO,T)  =:  sin0sech(r);  (3) 

thus,  the  starting  pulse  is  linearly  polarized  with  0  =  7r/4  or  7r/8  with  respect  to  the  a;— axis. 
To  rotate  the  polarization  of  the  pulse  in  a  birefringent  fiber,  the  input  wave  should  be 
different  amplitudes  along  each  principal  axis. 
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III.  RESULTS 


In  this  section  sonne  new  results  are  illustrated;  it  should  suffice  to  demonstrate  the 
capabilitities  of  the  simulation.  In  Figure  1  a  plot  of  the  intensity  along  each  polarization 
axis  after  1  and  2  dispersion  lengths  for  6  =  7r/4;  the  group  velocities  are  identical  for 
this  case.  The  pulse  is  amplified  and  compressed  as  it  attempts  to  retain  its  fundamental 
soliton  character  during  propagation;  the  pulse  width  is  limited  by  the  gain  dispersion  and 
eventually  it  breaks  up  as  the  energy  develops  higher  order  soliton  effects.  This  behavior  is 
analogous  to  that  found  by  Agrawal  for  a  single  polarization  [14].  The  simulations  without 
the  polarization  have  not  previously  examined  the  phases  of  the  fields.  In  the  new  simulations 
this  property  can  now  be  examined;  the  phase  of  the  pulses  in  Figure  2  is  strongly  modulated 
during  the  amplification  process;  before  the  pulse  breaks  up,  the  phase  changes  smoothly 
peaking  at  the  position  of  the  pulse  maximum;  after  2  dispersion  lengths  though,  the  phase 
is  modulated  with  several  local  maxima.  However,  because  there  is  no  differential  excitation 
of  the  axes  and  no  group  velocity  difference,  the  polarization  remains  constant. 

Now  consider  the  situation  with  different  group  velocities  and  9  =  tt/S  in  Figure  3, 
which  displays  the  two  polarization  intensity  maxima  walking  off  one  another.  After  two 
dispersion  lengths,  the  pulse  have  broken  up  displaying  multiple  maxima,  as  in  Fig.  1, 
but  the  side  peaks  show  a  greater  separation  from  the  central  peaks,  which  are  still  closely 
correlated.  The  state  of  polarization  of  a  pulse  is  distorted  and  strongly  so  as  the  pulse  phase 
is  modulated  and  the  polarizations  walk  off  one  another;  the  phase  difference  is  displayed  in 
Fig.  4  for  two  propagation  distances.  After  one  dispersion  length  [z  =  1),  the  phases  of  the 
two  polarizations  are  closely  correlated,  but  after  two  dispersion  lengths  (2  =  2),  the  phase 
differences  are  large;  the  central  peaks  are  in  phase,  but  the  side  peaks  have  large  phase 
differences.  This  affects  the  operation  of  the  birefringent  fiber  amplifier  devices  in  erbium- 
doped  fiber  lasers.  As  a  consequence,  the  operation  of  the  lasers  and  the  linear  amplifier 
devices  are  going  to  be  degraded  by  these  effects. 

Not  only  is  there  a  limit  imposed  on  the  pulse  width  and  intensity,  but  their  polarization 
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state  becomes  distorted  during  propagation  through  the  fibers.  As  the  pulses  walk  off  one 
another,  the  phase  modulations  at  first  remains  locked,  but  the  intensities  are  not  so  highly 
correlated,  therefore,  the  polarization  state  varies  across  the  pulse.  We  illustrate  this  in 
Figure  5,  which  shows  the  resultant  pulse  after  passing  through  a  polarizer  which  is  either 
parallel  or  perpendicular  to  the  input  pulse  polarization;  the  intensity  polarized  parallel 
to  the  input  wave  has  structure  and  the  orthogonal  polarization  has  a  broad  structured 
intensity.  The  action  of  a  birefringent  amplifier  and  a  polarizer  can  distort  the  pulse  shape. 

IV.  SUMMARY  AND  FUTURE  WORK 

We  made  considerable  progress  toward  a  realistic  simulation  of  a  fiber  laser.  The  amplifier 
and  fiber  characteristics  were  carefully  examined  including  effects  which  will  have  a  large 
effect  on  the  operation  of  the  laser.  The  fiber  lasers  can  be  developed  as  sources  of  ultrashort 
optical  pulses  for  gigabit  optical  communications  networks. 

During  the  program  a  publication  was  prepared  and  submitted  to  Photonics  Tecnology 
Letters.  That  paper  details  salient  features  of  our  simulations  of  a  birefringent  fiber  amplifier. 

Several  future  publications  are  planned  and  we  will  continue  our  collaboration;  the  fiber 
laser  simulations  will  incorporate  gain  saturation,  as  well  as  stimulated  Raman  scattering, 
self-steepenmg,  group  velocity  differences,  and  higher-order  dispersion  effects.  Since  we 
have  both  polarizations,  we  can  examine  the  evolution  of  the  pulses  polarization  in  the 
fiber  ,  as  well  as,  studying  the  pulse  in  the  time  and  frequency  domains.  The  simulations 
here  demonstrate  a  complicated  evolution  of  the  pulses,  which  ultimately  impacts  on  the 
operation  of  birefringent  amplifier  devices  in  a  laser  cavity. 
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FIGURES 

FIG.  1.  Tlie  intensity  of  the  x—  or  y—  polarization  for  two  propagation  distances  z  =  1  (solid 
line)  and  2  (dashed  line)  for  the  case  A  =  0  and  6  =  rf 4.  The  gain  parameters  are  F  =  2.3  and 

fi  =  0.2. 
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FIG.  3.  The  intensity  of  the  x—  and  y—  polarized  components  for  the  case  A  =  0.5  and  6  =  x/S. 
Two  propagation  distance  is  z  =  2.  The  gain  parameters  are  identical  to  the  previous  case. 
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FIG.  4.  Phase  difference  between  the  two  polarizations  for  the  intensity  shown  in  Fig.3. 
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FIG.  5.  The  intensity  of  the  piilse  in  Figure  4  as  it  passes  through  a  polarizer  parallel  to  the 
input  polarization  (solid  line)  and  perpendicular  to  the  input  polarization  (dashed  line). 
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Abstract 

Comparison  of  theory  and  experiment  is  critical  in  microelectronic  reliability. 
In  this  work,  a  model  which  has  been  previously  used  for  electromigration  time  to 
failure  predictions  (developed  by  Harrison)  is  extended  to  include  predictions  for 
numbers  of  voids  and  void  sizes.  Predictions  from  the  model  are  compared  to 
experimental  measurements.  The  model  makes  reasonable  predictions  for  mean 
lifetimes,  numbers  of  voids,  and  void  areas  with  very  few  free  parameters.  The 
model,  however,  does  not  adequately  reproduce  the  of  lifetimes  found  in 
experiment. 
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A  MACROSCOPIC  MODEL  OF  ELECTROMIGRATION: 
COMPARISON  WITH  EXPERIMENT 


Yolanda  J.  Kime 


I.  INTRODUCTION 

In  every  area  of  science  the  continuous  comparison  between  theory  and 
experiment  yields  the  growth  and  insight  which  drives  scientific  progress.  This  is 
no  less  true  in  microelectronic  reliability.  Much  work  has  been  done  to 
understand,  and  hence  to  control,  the  damage  and  the  ultimate  failure  of 
microelectronic  parts  due  to  electromigration  in  metal  interconnects.  From  a 
pragmatic  view,  the  characteristic  of  critical  importance  is  the  lifetime  of  the 
interconnect.  Most  of  the  theoretical  work,  and  much  of  the  experimental  work  as 
well,  has  focused  primarily  on  lifetime  measurements.  However,  other 
characteristics,  such  as  the  total  area  of  electromigration  voids  on  a  stripe,  can  be 
measured,  and  our  understanding  of  the  electromigration  process  can  only  be 
complete  when  these  other  characteristics  as  well  as  the  lifetimes  can  be  predicted 
accurately. 

The  purpose  of  this  work  is  test  the  extent  to  which  characteristics  of  thin  film 
electromigration  damage  such  as  void  areas  and  the  number  of  voids  can  be 
adequately  simulated  by  a  simple  model  [1]  which  has  previously  been  used  only 
to  examine  changes  in  the  time  to  failure.  The  experimental  results  which  are 
compared  to  the  simulation  results  are  discussed  in  detail  elsewhere  [2].  In  brief, 
A1  1%  Si  meander  test  stripes,  1600  p.  long,  3.4  p  wide,  and  0.8  p  thick,  were 
electromigrated  to  failure  at  a  constant  2  x  10^  A/cm^.  The  test  stripes  were 
passivated  with  0.4  p  of  Si02.  The  test  stripes  were  examined  for  void  damage 
using  a  Scanning  Electron  Microscope  (SEM).  The  void  areas,  perimeters,  lengths, 
and  widths  were  measured  on  thirty-eight  of  the  stripes  using  a  Cambridge  SEM 
equipped  with  a  Robinson  backscatter  detector  in  conjunction  with  an  Optimas 
digital  image  analysis  system. 


13-3 


II.  MODEL  DEVELOPMENT 


A.  MICROSTRUCTURE 

The  microstructure  and  diffusion  model  adopted  for  this  work  is  largely  from 
the  work  of  Hamson  [1],  In  this  model,  the  metal  grain  sizes  are  assumed  to  be 
log  normally  distributed,  with  the  peak  of  the  distribution  at  D^jg,  full  width  half 
maximum  of  W^jg,  and  a  skew  of  p^jg.  If  a  bimodal  distribution  is  used,  the  small 
grain  distribution  is  described  by  D^rnall ,  Wsmaii,  and  Ps^all-  The  stripe  is  divided 
up  along  its  length  into  segments  of  width  D^y,  where  D^y  is  the  average  of  D|jjg 

^small-  The  grains  are  then  placed  into  segments  at  random  positions  along 
the  stripe,  continuing  to  add  grains  to  each  segment  (as  it  is  randomly  chosen)  until 
the  width  of  the  stripe  is  reached  (see  Figure  la).  If  a  bimodal  distribution  is  used 
it  is  assumed  that  the  stripe  consists  of  sections  (a  number  of  segments  long)  which 
are  to  be  characterized  by  the  large  grain  distribution,  interspersed  with  sections 
which  are  characterized  by  the  small  grain  distribution.  The  large  grain  sections 
are  randomly  scattered  along  the  stripe,  with  the  small  grain  sections  filling  the 
remaming  length  (see  Figure  lb).  This  process  determines  the  number  of  grain 
boundaries  within  the  segment. 

The  interface  between  the  metal  stripe  and  the  passivation  at  the  sides  of  the 
stripes  is  also  included  as  a  grain  boundary  capable  of  passing  material  along  the 
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stripe.  This  is  a  departure  from  Harrison's  model.  These  interfaces  are  counted  as 
grain  boundaries  primarily  because  of  the  experimental  observation  that,  for  the 
stripes  I  attempted  to  model,  virtually  all  voids  have  an  edge  in  common  with  this 
interface  [2].  Since  this  is  true  even  for  the  smallest  voids,  it  seems  likely  that 
there  is  at  least  some  material  mobility  along  the  interface. 

Each  grain  boundary  is  assigned  an  angle  (p  with  respect  to  the  long  axis  of  the 
stripe,  and  an  angle  0  which  parameterizes  the  ion  mobility  along  the  grain 
boundary  [3].  These  angles  are  assigned  randomly,  within  limits,  except  the 
boundaries  at  the  interfaces,  for  which  (p  is  zero.  According  to  Attardo,  et  al  [3], 
the  mobihty  is  proportional  to  sin(e/2)  where  0°  <  0  <  37°,  and  proportional  to 
sm(37°/2)  where  37°  <  0  <  60°. 

It  should  be  noted  that,  although  the  number  of  grains  across  the  width  of  the 
stripe  is  determined,  the  model  is  still  essentially  one  dimensional.  The  number  of 
grain  boundaries,  and  0  and  (p  for  each  boundary,  are  used  to  determine  the  flux 
divergence  (the  difference  between  the  fractional  mass  flow  into  the  segment 
boundary  and  the  fractional  mass  flow  out  of  the  boundary),  but  the  model  does 
not  monitor  where  across  the  width  of  the  stripe  the  material  actually  flows. 

Explicitly,  the  flux  divergence,  using  Harrison's  notation  [1],  is 

FLUXDIV  =  Li  sm(0i/2)cos(Pi  -  Lj  sin(0j/2)cos(pj 

where  the  summation  on  i  is  over  the  number  of  grains  boundaries  in  the  segment 
to  the  left  of  the  segment  boundary  and  the  summation  on  j  is  over  the  number  of 
grains  in  the  segment  to  the  right  of  the  segment  boundary,  assuming  mass  flow 
from  left  to  right. 


B.  DIFFUSION 

The  motion  of  material  along  the  stripe,  like  the  microstructure  model,  is 
largely  from  Harrison's  work[l].  The  diffusion  of  atoms  in  electromigration  is 
assumed  to  obey  the  diffusion  equation  [4,5] 


^  ND\ 

<kT  j 


(Force) 
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where 

Ja  —  Atomic  flux 
D  —  Diffusion  coefficient 
N  ~  Number  density  of  metallic  ions 
k  —  Boltzmann's  constant 
T  —  Temperature 

Following  Harrison  [1],  let  Force  =  -grad(U)  and 

U  =  Z'qV  +ii^+kT\si{N  ! 

where 

Z  ~  Effective  mobile  ion  charge  number 
q  —  Electronic  charge 
V  —  Electrical  potential 
f^o  ~~  Reference  chemical  potential 
Nq  —  Equilibrium  ion  concentration 
n  ~  Atomic  volume 
Sfin  —  Mechanical  (hydrostatic)  stress 

Combining  these  equations  one  gets 

^  N)grad{N)  -  Q  grad{S„F\ 

where  E  is  the  local  electric  field  strength. 

It  should  be  noted  that  there  is  no  force  term  proportional  to  the  local  thermal 
gradient,  which  may  be  significant  for  the  stripes  I  am  trying  to  simulate  [6], 
Thermal  effects  are  taken  into  account  in  the  temperature  dependence  of  the 
diffusion  coefficient,  the  resistivity,  and  the  thermal  conductivity  of  the  metal. 
Explicitly,  it  is  assumed  that 

D  =  D^  expC-f^  /  kT) 

where 

Ea=Q-KS, 
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and 

Ea  "  Diffusion  activation  energy 
Q  -  Zero  hydrostatic  stress  activation  energy 
Vq—  Activation  volume 
5),  "  Hydrostatic  stress. 

Likewise, 

p  =  p„[l  +  a(7’-7;)] 

where 

p  —  Resistivity 

Pq  —  Resistivity  at  ambient  temperature 
a  —  Temperature  coefficient  of  resistivity 
Tq  -- Ambient  temperature 

Following  Harrison  [7],  three  sources  of  hydrostatic  stress  5),  are  considered. 
First,  there  may  be  residual  mechanical  stress  in  the  passivation.  Second,  there 
may  be  stresses  due  to  the  different  thermal  expansion  rates  of  the  various  layers. 
Third,  there  may  be  stresses  due  to  the  local  accumulation  of  metal. 

The  first,  residual  mechanical  stress,  depends  on  the  manufacturing  process  of 
the  stripes,  and  so  must  be  included  directly,  rather  than  calculated.  The  stress  due 
to  differential  thermal  expansion  is  assumed  to  have  the  form  [7] 

^  _  £(a'+a’')Ar 

“  »■  (l+v)(l-2v) 


where 


ZZ 


£[(1  -  v)a"+2  vct']Ar 
(l+v)(l-2v) 

^  ^substrate  ^metal 


^  _ fY 

^  ^passivation  ^metal 


and 

E  —  Young's  Modulus  of  metal 
a  —  Thermal  expansion  coefficient 
V  —  Poisson's  ratio  for  the  metal 


Hamson  [7]  points  out  that  if  a"  s  a'  then  =  Syy  =  the  stress  is 

approximately  hydrostatic.  However,  here  I  calculate  stress  due  to  differential 
thermal  expansion  from  5),  =  ^yy  +  following  Harrison. 

The  stress  due  to  mass  accumulation  is  assumed  [7]  to  have  the  form 


mass  accumulation 


(av') 

[kj 


where  K  is  the  bulk  modulus  for  the  metal  and  Vg  is  the  equilibrium  volume. 


For  ease  of  computation,  it  is  not  the  mass  flow  directly  which  is  calculated  at 
each  step,  but  rather  the  fractional  mass  change  —  the  porosity  change—  which  is 
determined.  The  change  in  the  porosity  P  with  time  is  defined  as 

dt  N  dt 


The  negative  assures  that  an  increase  in  porosity  corresponds  to  a  loss  of  material 
and  a  decrease  in  porosity  corresponds  to  a  gain  of  material. 

This  can  be  related  to  the  diffusion  equation  through  the  continuity  equation 


Combining,  one  gets 


dt~  N 


ND'> 
kT  j 


grad  {U) 
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The  temperature  at  each  time  step  is  determined  locally  by  assuming  a  steady 
state.  The  energy  created  by  Joule  heating  is  equal  to  the  energy  dissipated  by 
thermal  conduction.  Thus,  the  temperature  at  the  kth  segment  is  determined  by  [1] 


K 


m 


D 


[t; 


+  ■ 


av 

K 


OX 


KxK 


+ 


where 

Jg  —  Local  current  density 
Kff,  —  Metal  thermal  conductivity 
Kqx  -  Oxide  thermal  conductivity 
-  Metal  thickness 
hox  —  Oxide  thickness 
Tk  —  Temperature  at  segment  k 
Tq  —  Ambient  temperature 


The  metal  thermal  conductivity  is  assumed  to  follow  the  Wiedemann-Franz 
law.  That  is  =  LT/p  where  L  is  the  Lorenz  number,  T  is  the  local  temperature 
and  p  is  the  resistivity  .  For  aluminum  I  have  used  2.45  x  10^  WQ/deg^  for  L. 


For  determining  times  to  failure,  it  is  sufficient  to  determine  the  porosity  for 
each  time  step  -  the  easiest  failure  criteria  to  apply  is  simply  that  the  stripe  has 
failed  when  the  porosity  of  any  segment  becomes  1.0.  However,  since  one  of  the 
objectives  of  this  work  is  to  predict  the  distribution  of  void  areas  (which  was  not 
considered  in  Harrison's  work),  some  method  of  turning  porosities  into  void  areas 
must  be  chosen.  This  has  been  done  in  two  steps.  First,  a  segment  is  not 
considered  to  have  a  void  until  the  porosity  has  reached  some  critical  porosity. 
The  critical  porosity  thus  becomes  a  free  parameter.  Second,  once  the  critical 
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porosity  has  been  reached,  any  additional  porosity  gain  is  correlated  to  a  void  area 
by 

_  1  AF  _  1  A4 

A/  "  At  ~  Ag  At 

where  A^  is  the  segment  area  and  AA  is  the  increase  in  the  void  area. 

III.  RESULTS 

In  any  computer  simulation  it  is  important  to  verify  that  the  results  obtained 
are  the  result  of  the  model  being  tested  or  explored,  and  are  not  artifacts  of  the 
way  that  the  program  makes  the  calculation.  Results  can  vary  dramatically,  in 
particular,  on  the  size  of  the  time  increment  used  in  the  simulation.  For  this  model, 
I  found  that  the  mean  times  to  failure  varied  sharply  depending  on  the  time 
increment  (see  figure  2a).  The  mean  time  to  failure  varies  between  234  hrs  at  a  10 
second  time  increment  and  378  hrs  at  a  time  increment  of  5  x  10^  s  (for  a  median 
grain  size  of  3.0  ji  and  a  residual  stress  of  100  MPa).  Experimentally  the  mean 
time  to  failure  for  the  stripes  modeled  is  200+  20  hrs. 

The  relationship  between  mean  time  to  failure  and  the  time  increment  used, 
however,  is  not  strictly  monotonic,  especially  at  small  time  increments,  as  seen  in 
figure  2b.  This  relationship  may  be  due  to  two  competing  factors:  at  large 

Figure  2 


Time  Increment  (s)  Time  Increment  (s) 
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increments  information  is  lost  because  one  is  essentially  approximating  the 
electromigration  as  linear  in  time,  which  it  isn't.  At  small  time  increments, 
however,  information  may  be  lost  because  changes  are  smaller  than  the  precision 
of  the  computer  allows. 

The  number  of  voids  on  the  stripe  vary  in  a  similar  way  with  changing  time 
increment,  as  show  in  figure  3  for  a  critical  porosity  of  0.30. 

Reassuringly,  the  model  predicts  increasing  lifetimes  when  the  median  grain 
size  is  increased,  as  shown  in  figure  4.  This  trend  is  well  established 
experimentally  by  other  authors.  With  a  mean  time  to  failure  of  200+20  hrs  and  a 
median  grain  size  between  2.5  and  3.5|i,  our  experimental  results  lie  close  to  the 
predicted  values.  Figure  5  shows  that  the  number  of  voids  (and  also  the  total  void 
area)  decreased  with  increased  grain  size.  The  results  plotted  here  are  for  a  critical 
porosity  of  0.30  at  a  time  increment  of  1  x  10^  s. 

It  is  interesting,  also,  to  see  how  the  plot  of  void  area  on  the  stripe  as  a 
function  of  time  to  failure  changes  with  increasing  grain  size.  These  changes  are 
shown  in  figure  6  (for  a  critical  porosity  of  0.30  and  a  time  increment  of  10^  s).  At 
larger  grain  sizes,  the  void  area  on  the  stripe  does  not  grow  as  quickly  with 
increasing  time  to  failure.  The  range  of  times  to  failure  also  increases  with  greater 


Figure  3 
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Median  Grain  Size  (microns) 


Median  Grain  Size  (microns) 


median  grain  size. 

It  is  important  to  note,  though,  that  even  at  the  largest  median  grain  size  used 
the  slope  of  the  void  area  versus  time  to  failure  is  much  steeper  than  the  slope 
found  experimentally  as  shown  in  figure  6b.  Although  the  range  of  the  void  areas 
is  similar  m  experiment  and  simulation,  the  range  of  the  time  to  failure  is  much 


Figure  6 
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narrower  in  the  simulation.  This  general  result  was  true  for  all  of  the 
simulations  run;  the  model  simply  does  not  predict  as  large  a  variation  in  time 
to  failure  as  what  is  observed  experimentally. 

I  also  explored  the  changes  in  time  to  failure  and  number  of  voids  with 
changes  in  the  residual  stress  in  the  metallization,  as  shown  in  figure  7  (for  a 
critical  porosity  of  0.45  and  a  time  increment  of  1  x  10^  s).  Not  unexpectedly,  the 
mean  time  to  failure  increases  as  the  stress  goes  fi-om  tensile  to  compressive.  And, 
correspondingly,  the  total  void  area  and  the  mean  number  of  voids  decreases. 
With  a  residual  stress  of  200  MPa  on  the  experimental  stripes,  a  mean  void  area 
per  stripe  of  19  ±  2  \\?-,  and  a  mean  number  of  voids  of  28  ±  2,  the  simulations 
appear  to  give  results  very  different  form  the  experimental.  In  this  case,  however, 
I  think  the  difference  is  more  due  to  the  choice  of  critical  porosity  and  time 
increment  for  these  runs  than  due  to  a  failure  of  the  model  itself. 

As  with  increasing  median  grain  size,  increasing  residual  stress  somewhat 
increases  the  range  of  the  times  to  failure  predicted  and  decreases  the  slope  of  the 
number  of  voids  on  the  stripe  versus  the  time  to  failure,  as  shown  in  figure  8. 
However,  as  before,  the  range  of  the  times  to  failure  of  the  experimental  data  is 
still  much  larger  than  the  simulations,  as  shovm  in  figure  8b. 

One  addition  to  the  model  which  may  make  the  microstructure  more  realistic 
is  the  possibility  of  flaws  (in  this  case  porosity)  in  the  stripe  before 
electromigration  ever  takes  place.  Here  I  have  explored  die  effects  both  of 


Figure  7 
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Figure  8 


changing  the  size  of  the  flaws  and  of  changing  the  number  of  flaws.  When  flaws 
are  included  in  the  microstructure  both  the  number  of  flaws  on  the  stripe  and  the 
size  of  each  flaw  is  chosen  randomly,  each  up  to  a  given  limit.  So  changing  the 
size  of  the  flaws  actually  means  changing  the  maximum  size  of  the  flaws,  and 
likewise  for  the  number  of  flaws.  I  found  that  for  a  maximum  flaw  size  (porosity) 
of  0.05,  increasing  the  maximum  number  of  flaws  on  each  stripe  from  5  to  one 
thousand  resulted  in  only  minimal  changes  in  the  mean  time  to  failure  (on  the 
order  of  5%).  Changing  the  maximum  size  of  the  flaws  had  a  larger  influence  on 
the  times  to  failure. 

It  is  important  to  note  that  even  a  flaw  of  0.05  (ignoring  critical  porosity) 
corresponds  to  a  void  10  times  larger  than  the  resolution  limit  of  our  experimental 
measurements.  Furthermore,  on  all  the  38  stripes  measured,  fewer  than  a  dozen 
stress  voids  were  observed  on  the  guard  stripes  paralleling  the  electromigrated 
stripes.  So  even  starting  a  simulation  with  10  flaws  of  0.05  size  is  probably  more 
flawed  than  the  experimental  stripes  were. 

The  mean  time  to  failure  and  the  total  void  area  as  functions  of  the  maximum 
flaw  size  shown  in  figure  9.  These  simulations  were  run  with  a  critical  porosity 
of  0.3,  a  time  increment  of  1  x  10^  s,  and  a  maximum  of  100  flaws  per  stripe.  Not 
surprisingly,  the  mean  time  to  failure  drops  with  increasing  flaw  size.  And, 
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Figure  9 


Maximum  Flaw  Size  Maximum  Flaw  Size 


primarily  because  the  stripes  fail  sooner,  the  total  void  area  on  the  stripe  also 
decreases  with  increasing  flaw  size. 

Whereas  median  grain  size,  residual  stress,  and  maximum  flaw  size  may  be 
assessed  experimentally,  the  critical  porosity  is  truly  a  free  parameter.  Both  the 
experimental  value  for  mean  number  of  voids  per  stripe  (28  ±  2)  and  the  mean 
void  area  per  stripe  (19  ±  2  \L^)  suggest  a  critical  porosity  between  0.35  and  0.40 
(see  figure  10).  However,  comparing  the  experimental  distribution  of  void  areas 


Figure  10 
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with  simulations  at  diff¬ 
erent  critical  porosities 

(see  figure  1 1)  suggests  a 

critical 

porosity  between  0.25 
and  0.30.  It  is  actually 
rather  surprising  that  the 
distribution  of  void  areas 
(here  normalized  to  the 
number  of  stripes 

simulated)  can  be  fit  as 

closely  with  only  one 
truly  free  parameter. 
Changing  the  critical 
porosity  does  not  affect  the  mean  time  to  failure,  but  it  does  affect  the  slope  of  the 
number  of  voids  per  stripe  versus  the  time  to  failure,  as  seen  in  figure  12a.  The 
slope  decreases  with  increasing  critical  porosity.  None  the  less,  as  with  the 

median  grain  size  and  residual  stress,  all  of  the  slopes  are  steeper  than  the 

experimental  results,  as  seen  in  figure  12b.  These  simulations  were  done  with  a 
median  grain  size  of  3.0  |i  and  a  time  increment  of  1  x  10^  s. 


Figure  1 1 


Figure  12 
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IV.  CONCLUSIONS 


Although  the  agreement  between  the  model  predictions  and  the  experimental 
results  is  by  no  means  exact,  the  model  does  predict  reasonable  times  to  failure, 
numbers  of  voids,  and  void  area  distributions  with  very  few  free  parameters.  The 
model  correctly  predicts  some  well  documented  trends,  such  as  increasing 
lifetimes  with  increasing  grain  sizes.  The  model  does  not  adequately  predict  the 
range  of  times  to  failure.  The  range  of  times  to  failure  from  the  simulations  is 
consistently  five  to  ten  times  smaller  than  the  experimental  range.  A  more 
sophisticated  model  for  the  microstructure  may  help  provide  more  variability  in 
the  times  to  failure. 
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SEMICONDUCTOR  CYLINDER  FIBERS  FOR  FIBER  LIGHT  AMPLIFIER 

APPLICATIONS 

Philipp  Kornreich 
Professor 

DEPARTMEISTT  OF  ELECTRICAL  AND  COMPLTTER  ENGINEERING 
Syracuse  University 

ABSTRACT. 

We  have  measured  the  transmission  spectrum  of  two  type  of  Semiconductor  Cylinder  Fibers 
{SCyFs)at  the  Photonics  Center  of  Rome  Laboratories.  These  devices  were  fabricated  by 
Syracuse  University.  The  fiber  preforms  and  SCyFs  with  CdTe  and  CdS  semiconductor  cylinders 
were  tested.  We  have  shown  that  the  semiconductors  survive  the  fabrication  process.  A  number 
of  fibers  of  each  kind  were  tested  with  reproducible  characteristics. 
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SEMICONDUCTOR  CYLINDER  FIBERS  FOR  FIBER  LIGHT  AMPLIFIER 

APPLICATIONS 


Philipp  Kornreich 


INTRODUCTION 

We  have  tested  the  transmission  spectra  of  Semiconductor  Cylinder  Fibers  (SCyFs)  that 
were  fabricated  at  Syracuse  University.  These  results  will  be  submitted  for  review  for  publi¬ 
cation  in  “Photonics  Letters"  This  report  except  for  a  modified  introduction  contains  the  text  of 
the  submitted  publication.  The  SCyFs  consist  of  a  glass  core  surrounded  by  a  thin  semiconductor 
cylinder.  The  semiconductor  cylinder,  in  turn,  is  surrounded  by  a  glass  cladding,  see  Fig.  1.  A 
number  of  each  type  of  fiber  were  fabricated  by  Syracuse  University  and  tested  at  the  Photonics 
Center  of  Rome  Laboratories.  To  date  we  have  used  two  semiconductors  CdTe  and  CdS.  Spectral 
measurements  at  the  Photonics  Center  of  Rome  Laboratories  showed  that  both  types  of  semi¬ 
conductors  survived  the  fiber  fabricating  process.  The  fibers  had  reproducible  characteristics. 

Semiconductor  Micro  crystals^  (SM)  in  glass  are  a  predecessor  of  this  technology.  SMs 
have  a  large  number  of  surface  states  at  their  interfaces  with  the  host  glass.  These  interface 
states  are  responsible  for  non  radiative  recombinations  in  SMs.  There  are  techniques  available 
to  substantially  reduces  the  non  radiative  recombination  in  SCyFs. 


Fig.  1.  Typical  SCyF  construction.  The  device  consists  of  a  glass  core  surrounded 
by  a  semiconductor  cylinder.  The  semiconductor  cylinder,  in  turn,  is  surrounded 
by  a  glass  cladding. 
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An  application  of  these  devices  is  as  Semiconductor  Cylinder  Fiber  Light  Amplifier 
(SCyFLA)  for  fiber  optic  communication.  Another  application  is  as  non  linear  devices.  An  ap¬ 
plication  that  is  probably  much  father  in  the  future  is  to  develop  fiber  bundle  plates  consisting 
of  SCyFLAs  that  operate  in  the  visible  spectrum  and  are  electrically  pumped.  These  devices  could 
be  used  as  true  eye  glass  light  amplifiers.  The  Micro  Channel  Plates  that  are  used  in  current 
night  vision  devices  are  not  true  light  amplifiers.  The  viewer  observes  a  fluorescent  screen 
rather  than  the  actual  amplified  image. 

A  competing  technology  are  Rare  Earth  doped  glass  Fiber  Light  Amplifiers^  (REFLA).  Since 
there  are  a  great  variety  of  semiconductors  that  can  be  used,  SCyFLAs  can  be  tailored  to  work  at 
a  large  range  of  wavelengths.  Unlike  rare  earth  doped  FLAs  the  SCyFLAs  are  broad  band  and  thus 
capable  of  amplifying  broad  band  Wavelength  Division  Multiplexed  light  signals. 

SCvF  FABRICATION 


The  SCyFs  were  fabricated  by,  first,  vacuum  depositing  a  direct  band  gap  semiconductor, 
CdTe  or  CdS,  film  on  a  rotating  Pyrex  glass  rod  in  a  diffusion  pumped  vacuum  system.  The  vac¬ 
uum  deposited  semiconductor  film  was  probably  of  poor  quality  since  the  glass  rods  could  not  be 
heated  during  deposition.  A  Pyrex  glass  was  selected  that  has  a  softening  temperature  of  650  °C. 
This  is  lower  than  the  melting  points  of  935  °C  of  the  CdTe  and  1400  °C  of  CdS  semiconductors. 

Next,  the  semiconductor  coated  glass  rod  was  inserted  into  a  Pyrex  glass  tube  that  has  been 
closed  at  one  end.  This  structure  is  evacuated  and  collapsed.  N2  is  used  as  the  residual  gas  in  the 

evacuation  process.  Finally,  a  fiber  is  pulled  from  the  resulting  preform. 

EXPERIMENTAL  RF.RUIT.R 


Several  fibers  with  approximately  50  A  thick  CdTe  semiconductor  cylinders  and  CdS 
semiconductor  cylinders  have  been  tested.  These  fibers  had  approximately  70  pm  diameter 
Pyrex  glass  cores  and  140  pm  diameter  Pyrex  glass  claddings.  Material  availability  restricted 
us  to  use  glass  with  the  same  index  of  refraction  for  both  the  core  and  cladding.  Nevertheless, 
our  test  indicated  that  light  was  guided  in  the  core  by  the  semiconductor  cylinder. 

The  CdS  films  were  a  yellowish  gray  when  first  deposited  on  the  core  glass  rods  of  the  pre¬ 
form  indicating  that  they  where  Cd  rich.  However,  when  the  cladding  tube  was  collapsed  onto  the 
CdS  cylinder  at  about  650  °C  they  turned  bright  yellow.  Indeed,  a  spectrum  analysis  of  the  CdS 
preform  showed  that  it  had  a  sharp  absorption  edge  at  514  nm  exactly  as  predicted. 
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The  transmission  spectrum  of  CdTe  fiber  sections  about  20  mm  in  iength  and  CdS  fiber 
sections  about  150  mm  in  length  have  been  measured.  Sampies  were  used  from  several  fibers 
with  identicai  results.  Measurements  were  performed  at  the  Photonics  Center  of  Rome 
Laboratories  using  an  Ando  type  AQ  1425  spectrum  anaiyzer  connected  to  an  Apple  “Power 
Book”  computer.  The  computer  used  “Lab.  Window”  soft  ware  to  both  controi  the  spectrum 
analizer  and  to  record  the  data.  The  CdTe  fibers  exhibit  an  absorption  edge  at  830  nm,  see  Fig.  2. 
This  corresponds  to  the  absorption  edge  of  bulk  CdTe.  The  transmission  spectrum  of  one  of  the 
preforms  was  also  measured.  It  too  exhibited  an  absorption  edge  at  830  nm.  A  smaii  quantum 
size  effect  was  predicted  for  the  fibers.  That  is,  we  expected  the  fibers  to  have  an  absorption 
edge  at  a  waveiength  somewhat  shorter  than  the  absorption  edge  wavelength  of  bulk  CdTe  in  the 
preforms.  This  was  not  observed.  We  suspect  that  surface 


Fig.  2.  Typical  transmission  spectrum  of  CdTe  cylinder  fiber  with  white  light 
illumination.  The  data  was  normalized  to  a  fiber  without  semiconductor  cylinder. 
The  spectrum  was  normalized  using  a  Pyrex  fiber  with  no  semiconductor 
cylinder.  A  resolution  of  10  nm  was  used. 
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states  counteracted  the  quantum  size  effect.  There  is  also  considerable  strain  in  the  semiconduc¬ 
tor  film.  The  CdS  fibers  exhibit  a  steep  absorption  edge  at  514  nm,  see  Fig.  3.  This  corresponds 
to  the  absorption  edge  of  bulk  CdS.  The  absorption  spectrum  of  one  of  the  preforms  was  also 
measured.  It  too  exhibited  an  absorption  edge  at  514  nm.  Again,  a  small  quantum  size  effect  was 
predicted  for  the  fibers.  This  was,  also,  not  observed.  All  measurements  were  performed  at 
room  temperature. 


Fig.  3.  Typical  transmission  spectrum  of  CdS  cylinder  fiber.  A  resolution  of  10 
nm  was  used  to  obtain  this  data. 


From  the  above  data  we  conclude  that  the  semiconductors  indeed  survive  the  fiber  fabri¬ 
cating  process. 

MECHANISM  OF  CHARGE  CARRIER  LIGHT  INTFRAHTinM 

One  can  illustrate  the  interaction  of  the  charge  carriers  in  the  semiconductor  cylinder  with 
the  light  electromagnetic  field  near  the  core  cladding  interface  as  follows:  The  transverse  elec- 
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trie  field  of  the  light  wiil  induce  a  current  with  current  density  J  in  the  thin  semiconductor 
film  see  Fig.  4.  The  current  depends  on  the  charge  carrier  density  in  the  semiconductor.  This 
current  density  induces  a  discontinuity  of  the  longitudinal  magnetic  flux  density  B  at  the  core 
cladding  interface.  This  requires  that  there  be  a  mode  propagating  in  the  fiber  with  an  electric 
field  that  does  not  go  to  zero  at  the  core  cladding  interface. 

The  interaction  between  the  light  and  the  charge  carriers  in  the  semiconductor  cylinder  can 
be  illustrated  by  an  analysis  of  the  propagation  of  the  TE.|  q  mode  in  a  SCyF.  Rather  than  per¬ 
forming  the  calculations  in  terms 


Fig.  4.  The  transverse  light  electric  field  induces  a  current  with  current  den¬ 
sity  J  in  the  thin  semiconductor  film.  The  current  depends  on  the  charge  carrier 
density.  The  current  density  induces  a  discontinuity  of  the  longitudinal  magnetic 
flux  density  B  at  the  core  cladding  interface. 

of  the  conductivity  at  light  frequencies  of  the  semiconductor  cylinder  we  use  the  imaginary  part 
of  the  dielectric  constant  Ej  which  is  related  to  this  conductivity.  The  dispersion  relation  of  the 

TE.|q  mode  of  a  step  index  fiber  with  a  semiconductor  cylinder  having  an  imaginary  (lossy)  part 
of  the  dielectric  constant  can  be  obtained  in  a  way  similar  to  the  derivation  of  the  dispersion 
relation  of  the  TE.,q  mode  of  a  conventional  step  index  fiber®. 

uJJub)  4Tt^e^  wK-(wb) 

__2 - +j - L  +  — 2 - =0  (1) 

J/ub)  '  ^  K/wb) 

where  the  solutions  of  the  wave  equations  in  the  core  and  cladding  yield: 
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2  2 

2  2  4  A; 

a)  k  +  u  - - - 


A  2  2 

2  2 

and  b)k  -  w  - - 1 

x^ 


k  is  the  complex  propagation  constant  of  the  TE^  q  mode,  b  is  the  core  radius,  a  is  the  semicon¬ 
ductor  cylinder  thickness,  and  n^  and  ng  are  the  indices  of  refraction  of  the  core  and  cladding 
respectiveiy. 

By  solving  the  dispersion  relations  for  the  real  part  p  of  the  propagation  constant  k  we  ob¬ 
tain  a  result  identical  to  the  dispersion  relation  of  the  TE^q  mode  of  a  simple  step  index  fiber  as 

it  appears  in  the  iiterature^. 

WoKo(Wob) 

J/Uob)  K/Wgb) 


where  u^  and  w^  are  the  real  parts  of  the  transverse  propagation  constants  u  and  w.  The  imag¬ 
inary  part  a  of  the  propagation  constant  k  can  be  obtained  by  expanding  equation  4.1  to  first 

order  in  small  quantities  and  making  use  of  equations  4.2a,  4.2b,  and  4.3.  Here  a  is  the  gain 
coefficient. 


2 

2jt  Eja 


(4) 


Note  that  B  and  u^  are  wavelength  dependent.  Since  both  u^  and  w^  must  be  real  we  observe  from 

equations  4.2  that  the  real  part  of  the  propagation  constant  8  must  have  values  between  two 
narrow  limits; 


2  „2  X  2 

4rc 


(5) 


In  order  for  the  gain  coefficient  to  be  non  zero  the  first  order  Bessel  functions  (u^b)  must  be 
non  zero  at  the  core  radius  b  for  this  mode. 
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The  result  obtained  in  equation  4.4  can  not  be  directly  compared  to  the  experimental  results 
since  the  fibers  used  in  the  experiments  had  70  p.m  diameter  cores  through  which  a  multitude  of 
modes  propagated. 
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SELF-PULSATION  AND  OPTOELECTRONIC  FEEDBACK-SUSTAINED 
PULSATION  OF  LASER  DIODES  AT  1300  NM 


Guifang  Li 
Assistant  Professor 
Department  of  Electrical  Engineering 


Abstract 


Transient  self-pulsation  has  a  lifetime  of  a  few  minutes  with  frequencies  up  to  7  GHz.  The  linewidth  of 
self-pulsation  is  on  the  order  of  0.5  GHz.  With  optoelectronic  feedback,  the  transient  self -pulsation  can  be 
stabilized  and  enhanced.  The  center  frequency  of  feedback-sustained  pulsation  is  dependent  on  the 
passband  of  the  bandpass  filter  in  the  feedback  loop.  The  linewidth  of  feedback-sustained  pulsation  is 
significantly  reduced  to  about  20  kHz.  The  optical  spectra  of  the  laser  diodes  exhibit  coherence  collapse 
at  weak  optoelectronic  feedback.  The  feedback  sustained  pulsation  can  be  frequency  modulated. 
Applications  of  the  feedback-sustained  pulsation  include  subcarrier  multiplexing  optical  networks. 
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Self-sustained  pulsation  (SSP)  is  a  well-known  dynamic  process  in  laser  diodes  [1].  It  has  been 
extensively  studied  both  experimentally  and  theoretically  [2,3,4].  The  majority  of  these  previous 
experiments  has  been  conducted  in  780  nm  laser  diodes.  SSP  was  first  considered  to  be  a  nuisance  and 
there  were  efforts  aimed  at  reducing  or  eliminating  SSP.  It  was  realized  later  on  that  SSP  laser  diodes 
have  low  relative  intensity  noise  (RIN)  because  of  short  coherence  length  which  reduced  the  effects  of 
optical  feedback  into  the  laser  diodes.  SSP  laser  diodes  have  also  been  proposed  for  use  in  all-optical 
clock  recovery  [5].  Recently,  a  promising  new  application  has  emerged  in  which  SSP  would  be  used  as 
subcarriers  for  lightwave  communication  networks  [6].  Both  analog  (up  to  1  GHz)  [7]  and  digital  (up  to 
800  Mb/s)  [8,9]  transmission  using  SSP  subcarriers  have  been  reported.  Subcarrier-multiplexing  (SCM) 
not  only  is  compatible  with  wavelength-division  multiplexing  (WDM)  but  also  can  extend  the  capacity  of 
WDM  networks  [10].  As  a  result,  WDM-SCM  networks  are  of  particular  interest  for  local  area  networks 
(LAN)  where  pure  WDM  may  not  be  able  to  provide  a  large  number  of  nodes.  It  is  therefore  important  to 
generate  microwave  subcarriers  in  laser  diodes  at  wavelengths  other  than  780  nm.  This  letter  describes 
SSP,  transient  self-pulsation  (TSP),  and  the  generation  and  frequency  modulation  of  feedback-sustained 
pulsations  (FSP)  in  laser  diodes  at  1300  nm. 


Fig.  1  Schematic  of  the  experimental  setup  for  studying  SSP,  TSP  and  FSP. 
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The  schematic  of  the  experimental  setup  is  shown  in  Fig  1.  The  pulsations  were  observed  in 
Mitsubishi  Electric  ML7011R  InGaAsP  single-mode  (pigtailed)  laser  diodes  (LD).  The  emission 
wavelength  of  these  laser  diodes  is  near  1300  nm  with  typical  cw  output  powers  of  5  mW.  The  laser 
diode  is  temperature  stabilized  to  20^  C  using  a  thermal  electric  cooler  (TEC).  The  pigtailed  fiber  from 
the  laser  diode  is  fusion  spliced  to  a  3dB  bi-directional  fiber  coupler.  Half  of  the  laser  output  was  used  for 
monitoring  the  optical  spectrum  using  an  Anritsu  MS9001B1  optical  spectrum  analyzer  (OSA).  The  other 
half  of  the  laser  output  was  used  as  a  feedback  into,  the  laser  diode  through  an  optoelectronic  feedback 
loop.  The  first  component  in  the  optoelectronic  feedback  loop  is  a  photodetector  (PD)  which  converts 
laser  light  pulsations  into  microwave  oscillations.  The  microwave  oscillation  signal  is  pre-amplified  using 
a  Miteq  18-GHz  wideband  low-noise  amplifier.  The  amplified  signal  is  split  into  two  parts  using  an  rf 
power  splitter.  Half  of  the  rf  power  was  used  for  monitoring  the  spectrum  of  the  microwave  oscillations 
using  a  HP  8593E  microwave  spectrum  analyzer.  The  second  half  of  the  pre-amplified  signal  was  fed  into 
the  laser  diode  through  the  ac  port  of  the  bias  tee  via  a  bandpass  filter  (BPF),  another  wideband  amplifier 
and  a  microwave  power  combiner  (COM).  The  other  input  port  of  the  COM  was  used  for  modulating 
signals. 
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Figure  2  Microwave  spectra  of  (a)  SSP  at  a  bias  current  of  20  mA,  (b)  TSP  at  a  bias 
current  of  21  mA  immediately  after  turn  on,  and  (c)  same  as  (b)  35  seconds  after  turn  on. 


Self -pulsations  in  the  laser  diodes  (with  the  feedback  being  turned  off)  fall  into  two  catagories, 
SSP  and  TSP.  The  laser  intensity  pulsations  observed  in  this  experiment  are  predominantly  transient  in 
nature.  SSP  only  occurs  within  a  very  small  range  of  bias  current.  The  frequency  range  of  the  self¬ 
pulsation  is  between  2.0  GHz  to  7.0  GHz.  All  laser  diodes  we  have  examined  showed  qualitatively 
similar  pulsation  behavior.  For  example,  a  particular  laser  diode  exhibited  self  pulsation  for  bias  currents 
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between  17.0  and  32.0  mA  with  pulsation  frequencies  ranging  from  3.5  GHz  to  6.5  GHz.  However,  self- 
sustained  pulsation  only  occurs  for  bias  current  values  near  20  mA.  The  microwave  spectrum  of  the  SSP 
at  a  bias  current  of  20  mA  is  shown  in  Fig.  2a.  Self-pulsation  for  other  bias  current  were  transient,  in 
which  the  intensity  pulsation  occurred  as  soon  as  the  laser  diode  was  turned  on  or  turned  to  a  different 
bias  current.  Transient  self-pulsations  typically  only  lasted  for  1  to  3  minutes.  Fig  2b  shows  the 
microwave  spectrum  of  the  TSP  immediately  after  the  laser  diode  was  turned  on  with  a  bias  current  of 
21.0  mA.  The  linewidth  of  the  self-pulsation  is  very  broad  and  is  on  the  order  of  0.5  GHz.  The 
microwave  spectrum  35  seconds  after  the  laser  was  turned  on  is  shown  in  Fig.  2c.  The  magnitude  of  the 
self-pulsation  reduced  significantly  in  35  seconds.  The  TSP  dispeared  in  about  45  seconds. 

Optoelectronic  feedback  has  been  used  previously  work  to  reduce  the  linewidth  of  SSP  [6,7].  In 
this  experiment,  optoelectronic  feedback  was  used  to  stabilize  the  TSP.  We  foimd  that  optoelectronic 
feedback  indeed  stabilized  the  pulsations.  Furthermore,  the  linewidth  of  FSP  was  also  significantly 
reduced  compared  to  that  of  the  TSP.  However,  the  center  frequency  of  the  FSP  (nearly  identical  to  that 
of  the  BPF)  was  mainly  determined  by  the  pas sb and  of  the  bandpass  filter.  The  frequency  of  FSP  can 
therefore  be  timed  by  the  BPF.  In  addition,  the  frequency  of  FSP  with  a  fixed  BPF  can  be  fine  timed  by 
laser  diode  bias  current.  This  fine-tune  range  was  about  45  MHz.  Along  with  the  generation  of  FSP  with 
optoelectronic  feedback  were  changes  in  the  optical  spectrum  of  the  laser  light. 
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Figtire  3  Optical  spectra  of  laser  ouput  for  (a)  free  running  condition,  (b)  weak 
optoelectronic  feedback,  and  (c)  strong  optoelectronic  feedback. 


We  first  present  the  optical  spectra  of  the  laser  output  with  and  without  feedback.  The  laser 
diodes  used  in  this  experiment  is  single  mode.  The  spectrum  of  a  free  running  laser  diode  in  the  TSP 
regime  is  shown  in  Fig.  3a.  There  existed  occasional  mode  hopping.  In  this  particular  spectrum  the  two 
side  modes  were  about  7dB  down  from  the  main  mode.  Almost  all  the  optical  power  was  contained  in  a 
wavelength  span  of  about  2.0  nm.  Optoelectronic  feedback  can  significantly  affect  the  optical  spectrum  of 
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the  laser  diode  output.  Figure  3b  is  the  optical  spectrum  of  the  laser  output  when  the  feedback  was 
slightly  above  a  threshold  value  for  stable  FSP.  Now  the  wavelength  span  is  about  15  run.  This  is  very 
sumlar  to  the  coherent  collapse  observed  in  laser  diode  with  external  optical  feedback  [11].  As  the 
feedback  was  increased  beyond  a  critical  value,  the  optical  spectrum,  shown  in  Fig.  3c,  switched  into  a 
form  that  more  resembled  that  of  the  free  running  laser  diode  except  for  a  few  additional  very  weak  side 
modes.  It  is  worth  mentioning  that  "strong"  and  "weak"  feedback  here  are  only  qualitatively  defined  since 
the  bias  tee  is  not  broadly  matched  to  the  laser  diode  and,  as  a  result,  it  was  not  possible  to  easily  estimate 
the  extent  of  reflection  of  the  feedback  signal. 

Now  we  present  the  electrical  characteristics  of  the  FSP.  As  mentioned  earlier,  the  frequency  of 
the  FSP  is  predominantly  determined  by  the  passband  of  the  bandpass  filter.  With  a  BPF  of  center 
frequency  4.68  GHz  and  10%  passband,  the  microwave  spectrum  of  the  FSP  is  shown  in  Fig.  4a.  The 
fundamental  frequency  of  the  FSP  is  4.70  GHz.  High-order  harmonics  are  also  present  in  the  FSP  signal 
but  are  of  lower  amplitudes.  The  second  harmonic  is  20  dB  below  the  fundamental  harmonic.  The 
linewidth  of  FSP  is  very  narrow  as  can  be  seen  in  Fig.  4b  which  is  a  close-up  look  of  the  fundamental  peak 
in  the  FSP  signal. 
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Figure  4  Microwave  spectra  of  FSP,  (a)  all  harmonics  and  (b)  fundamental  harmonic  only. 

When  a  modulating  signal  is  applied  to  the  laser  diode,  the  frequency  of  FSP  is  modulated  by  the 
applied  modulating  signal.  Figure  5  shows  the  spectrum  of  the  frequency  modulated  FSP  signal.  The 
carrier  is  the  4.70  GHz  FSP  and  the  modulating  signal  is  a  200  MHz  sinusoid.  Eight  sidebands  are 
observed.  The  imeven  first  sidebands  signifies  residual  amplitude  modulation.  This  is  expected  because 
the  laser  output  power  is  also  dependent  on  the  bias  current.  Frequency  modulation  of  FSP  by  signals  of 
frequencies  exceeding  500  MHz  have  been  demonstrated  in  the  experiment.  This  FM  signal  can  be 
demodulated  using  standard  FM  demodulators  such  as  the  delay  line  demodulator.  We  have  achieved  a 
signal-to-noise  ratio  of  about  30  dB.  The  details  of  communication  links  and  networks  using  FSP  will  be 
presented  elsewhere. 

In  conculsion,  we  report  the  observation  of  self-sustained  pulsation  and  transient  self-pulsation  in 
1300  nm  laser  diodes.  The  mechanism  for  SSP  is  generally  believed  to  be  the  formation  of  periodic  orbits 
due  to  Hopf  bifurcation  in  the  laser  diode  nonlinear  dynamics  [6].  Transient  self-pulsations  have  not  been 
reported  before.  The  origin  of  TSP  is  still  imder  investigation.  Although  TSP  is  similar  to  the  relaxation 
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oscillation  [12],  the  transient  time  of  minutes  for  TSP  is  much  longer  than  typical  relaxation  oscillation 
time.  With  optoelectronic  feedback,  TSP  and  SSP  turn  into  stable  FSP  which  can  be  used  as  high- 
frequency  tunable  microwave  subcarriers  at  1300  nm.  For  WDM-SCM  applications,  it  is  important  to 
provide  sufficient  optoelectronic  feedback  so  that  the  optical  spectrum  of  the  laser  output  remains 
narrow.  Even  though  FSP  can  be  frequency  modulated  to  very  high  frequencies  (>500  MHz),  the  peak 
frequency  deviation  is  within  45  MHz.  We  are  currently  investigating  methods  for  achieving  larger  peak 
frequency  deviation/modulation  index  for  improved  signal-to-noise  ratios. 
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Figure  5  Microwave  spectrum  of  a  4.7  GHz  FSP  frequency  modulated  by  a  200  MHz 

sinusiodal  signal. 
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PRELIMINARY  REPORT  ON  THE  FEASIBILITY  OF  MACHINE 
SPANISH  DIALECT  IDENTIFICATION  AND  DESCRIPTION  OF  LATIN 
AMERICAN  DIALECT  DATABASE  (LADD)  DEVELOPMENT 
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Abstract 

The  feasibility  of  machine  Spanish  dialect  identification  based  on  phonetic, 
lexical,  syntactic  or  prosodic  characteristics  is  investigated.  It  is  concluded  that 
prosody  is  a  promising  direction  to  pursue.  The  development  of  a  digitized  Latin 
American  Dialect  data  base  is  described. 
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PRELIMINARY  REPORT  ON  FEASIBILITY  OF  MACHINE  SPANISH 
DIALECT  IDENTIFICATION  AND  DESCRIPTION  OF  LATIN 
AMERICAN  DIALECT  (LADD)  DATA  BASE  DEVELOPMENT 

Beth  L  Losiewicz 


PART  I:  DIALECT  IDENTIFICATION 


A:  TAXONOMY 

One  of  the  first  problems  that  will  arise  in  the  deveopment  of  machine  dialect 
identification  of  Spanish  is  the  need  for  an  adequate  and  agreed  upon  system  of 
dialect  classification.  There  is  "not  a  single  element  of  structure  . . .  considered  one 
at  a  time  and  excluding  vocabulary  which  would  not  be  common  to  at  least  two 
different  dialect  regions  in  Spain  or  in  America  (Honsa,  cited  in  Resnick,  1975). 
"Only  a  combination  of  two  or  more  structural  features,  can  be  called  characteristic 
of  one  dialect"  (Resnick,  1975). 

Henriquez  Urena  (cited  by  Resnick,  1975)  based  his  Latin  American  (LA) 
dialect  taxonomy  on  the  indigenous  language  substrate,  dividing  LA  into  5  regions 
based  on  Nahuatl,  Caribe,  Yarahuaca,  Qjiechua  and  Guarani.  Resnick  claims  that 
although  this  taxonomy  is  fairly  adequate  for  delineating  regions  of  lexical 
influences,  it  is  wholly  inadequate  as  a  phonetic  taxonomy. 

The  alternate  approach  is  to  divide  the  dialect  zones  by  a  combination  of 
phonetic  and  morpho-syntactic  features,  but  there  appears  to  be  little  agreement 
among  dialecticians  as  to  which  features  should  be  used  in  such  a  classification 
system. 

Rona  (cited  in  Resnick,  1975)  proposes  a  binary  taxonomy  based  on  phonetic 
and  morphosyntactic  features: 

(1)  yeismo  (phonemic  leveling  of  /ll/  and  /y/) 

(2)  zeismo  (phonetic  rendition  of  palatalized  711/  and/or  /y/  as  the 

voiced  alveopalatal  fricative  [zh] 

(3)  voseo  (use  of  vos  for  second  person  singular) 
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(4)  which  verb  form  used  with  vos 

Resnick  (1975)  has  developed  a  region/dialect  classification  system,  based  on 
binary  features,  which  is  probably  the  most  inclusive  to  date.  As  his  most  salient 
discriminator,  Resnick  choses  whether  or  not  the  /s/  is  deleted,  retained,  or  modified 
in  certain  contexts.  Broadly  speaking,  in  coastal  areas  which  had  extended  periods  of 
contact  with  Spain  and  were  thus  influenced  by  Peninsular  language  changes, 
word-final  / s/  is  aspirated  or  deleted.  Word-final  / s/  is  likely  to  be  retained  in  more 
isolated  and  mountainous  areas  of  Latin  America. 

Resnick's  second  major  distinction  is  the  whether  the  fricative  /x/  (as  in  caja, 
jota)  has  a  pharyngeal  or  velar  place  of  articulation.  Northern  hemisphere  LA 
regions  with  Atlantic  or  Gulf  of  Mexico  coasts,  coastal  Peru,  Ecuador  and  the  Western 
Texas-Mexican  border,  tend  to  use  a  pharyngeal  [h],  sometimes  indistinguishable 
from  an  aspirated  / s/ .  The  velar  /x/  is  found  in  the  rest  of  Mexico  (except  the  Gulf 
coast  and  a  few  inland  areas),  and  in  all  of  Southern  LA,  in  addition  to  the  non-coastal 
parts  of  Peru  and  Ecuador.  (Resnick,  1975).  (The  complexity  of  region-to-dialect 
feature  mapping  begins  to  appear!) 

The  third  major  feature  in  Resnick's  scheme  for  LA  dialect  discrimination  is 
the  pronunciation  of  /r/.  In  standard  Spanish  /r/  is  a  voiced  apicoaveolar  trill,  but 
appears  as  a  voiced  apicovelar  sibilant  in  Chile,  Paraguay,  the  interior  of  Argentina, 
and  bordering  areas  of  Peru  and  Bolivia.  The  same  sibilant  / r/,  however,  also 
appears  in  the  geographically  distant  areas  of  highland  Ecuador,  parts  of  central 
Colombia,  widely  separated  parts  of  Costa  Rica,  Guatemala,  Mexico  City,  much  of 
Central  Mexico,  and  New  Mexico. 

Resnick  (1975)  proposes  a  dialect  classification  system  based  on  a  combination 
of  binary  features  such  as  these,  as  illustrated  below  with  two  of  his  proposed 
features: 


/s/  =  [h]  or  0 

II 

/x/  =  [x] 

Chile,  Argentina,  Uruguay, 
Paraguay,  parts  of  Bolivia 

Mountain  regions  of  Peru, 
Bolivia,  Ecuador  and  parts 
of  Mexico 
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Northern  Countries 

/x/=  [h] 

The  mapping,  of  course,  becomes  geometrically  more  complex  as  we  add  binary 
features  to  the  classification  system,  but  in  Resnick’s  system  this  is  done  with  only 
five  additional  features:  presence  or  absence  of  "yeismo"  (phonemic  leveling  of 
palatalized  /U/  and  /y/  as  in  call6  vs.  cayo),  whether  /b/  becomes  /v/  after  /!/  (ei 
bebe  =>  el  vebe);  whether  or  not  word-final  /n/  is  velarized  {corazon  =>  corazong  or 
corazom);  presence  or  absence  of  leveling  or  confusion  of  /!/  and  /r/  (mar  =>  mal); 
and  reduction  or  retention  of  unstressed  vowels.  This  eight-feature  classification 
system  delimits  256  possible  feature  combinations.  Once  a  general  classification  has 
been  made  on  the  basis  of  these  binary  distinctions.  Resnick  further  classifies 
dialects  on  more  fine  grained  distinctions  (e.g.,  the  manner  of  articulation  of 
intervocalic  stops;  pronunciation  of  /c/;  of  /f/;  details  of  the  nature  of  the  /ll/-/y/ 
and  /l/-/r/  leveling;  the  nature  of  /r/,  /s/  and  /x/;  and  the  leveling  and/or 
devoicing  of  vowels). 

Even  if  a  classification  scheme  such  as  Resnick's  proves  useful  for  machine 
dialect  identification,  a  further,  and  possibly  insurmountable,  difficutly  still  exists. 
Even  within  a  given  dialect  or  dialect  region,  great  variability  exists  both  within  and 
between  speakers.  Social  class,  urbanicity,  level  of  education,  ethnic  backgroimd, 
and  speaking  style  (e.g.,  formal  vs.  informal)  all  contribute  to  large  amounts  of 
variability  within  any  given  dialect  region.  Further,  dialectal  features  are  more 
often  graduated  or  probabilistic  than  binary  phenomena,  both  in  terms  of  how 
frequently,  and  how  strongly  the  feature  appears  in  any  given  speech  segment. 
Especially  since  only  limited  information  about  frequency-of-occurrence  and  inter¬ 
speaker  dialect  variability  exists,  this  variation  could  seriously  complicate  dialect 
identification  efforts. 

B:  SOURCES  OF  DIALECTS 

Contemporary  dialects  in  LA  derive  from  a  complex  interaction  of  a  number  of 
causes:  the  dialect  of  Peninsular  Spanish  spoken  by  the  original  immigrants  to  the 
region;  the  degree  and  length  of  contact  maintained  with  Peninsular  culture;  the 
nature  of  the  indigenous  language  of  the  area;  patterns  of  commerce  and  interaction 
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with  other  areas;  and  degree  of  geographic  and  cultural  isolation;  in  addition  to  any 
local  pressures  towards  unique  diachronic  changes.  It  cannot  be  too  strongly 
stated  that  dialect  boundaries  in  LA  seldom,  if  ever,  follow  modem  day 
political  boundaries.  A  more  suitable  delineator  might  be  the  identity  and 
geographical  boundaries  of  pre-European  indigenous  groups  (e.g.  Nahuatl;  Mayan; 
Quechua;  Arucana;  to  name  a  few)  since  modem  day  dialect  boundaries  tend  to  fall 
along  those  of  the  ancient  language/cultural  groups,  although,  as  mentioned,  this 
factor  interacts  with  others  such  as  duration  of  contact  with  Penninsular  Spanish, 
and  local  regional  topography,  ffighland  dialects,  for  example,  tend  to  differ  fairly 
systematically  from  coastal  dialects  (at  least  in  part  because  coastal  areas  tended  to  be 
much  more  open  to  continued  Peninsular  influence). 

C.  POSSIBLE  SOURCES  OF  IDENTIFYING  ELEMENTS 

Dialectical  differences  in  Spanish  can  appear  at  any  linguistic  level.  Most 
studied  are  the  phonetic  and  lexical  differences,  but  differences  have  also  been 
document  at  the  levels  of  syntax,  morphology,  and,  less  extensively,  in  prosody.  We 
will  look  at  each  in  turn  as  possible  candidates  for  use  in  machine  dialect 
identification  algorithms 

Cl:  PHONETIC  DIFFERENCES 

Ideally  what  is  needed  is  a  handful  of  phonologically  distinctive  features  that 
would  successfully  delineate  clear  geographic  and/ or  social  class  variants  of 
Spanish.  However,  it  is  doubtful  whether  such  a  map  is  possible,  or,  if  it  did  exist,  it 
might  be  too  complex  and  indeterminate  to  be  useful. 

To  further  aggravate  the  problem,  the  phonetic  differences,  although  clearly 
discernible  (at  least  in  their  strongest  forms)  by  the  human  ear,  may  be  too  subtle 
for  machine  discrimination.  And,  even  if  a  recognition  algorithm  could  pick  out  the 
differences,  other  serious  problems  stand  in  the  way  of  easily  using  those  phonetic 
features  as  indices  of  dialect.  First  of  all,  the  phonetic  variations  are  often 
phonologically  mediated  (i.e.,  they  occur  in  certain  phonological  contexts  and  not  in 
others)  or  even  lexically  mediated,  or  occur  only  in  specific  words.  For  example, 

/r/  -  /!/  leveling  may  fail  to  occur,  even  in  leveling  dialects,  in  the  case  of  infinitves 
(hablar)  and/or  object  pronouns  (le,  Id).  In  Chile,  as  a  further  example,  aspirated 
[h]  appears  only  the  lexical  item  huir.  Conversely,  coarticulatory  phenomena  can 
condition  leveling  even  in  dialects  where  leveling  is  not  a  dialectical  feature,  e.g., 
the  coarticulatory  leveling  of  /l/-/r/  in  the  word  arboL  Dialect  recogntion  based  on 
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these  features  would  thus  require  more  than  simple  phonetic  recognition.  It  would 
require  processing  at  least  at  the  lexical  level,  making  the  recognition  process  much 
more  complex  (assuming  for  the  moment  that  even  simple  phoneme  discrimination  is 
possible  without  lexical-level  information). 

The  second  large  problem,  as  mentioned  above,  arises  from  inter-  and  intra¬ 
speaker  variation.  Any  given  speaker,  in  a  long  passage  of  speech,  will  exhibit 
subtle  allophonic  variations  in  certain  phonological  contexts,  (e.g.,  in  the  phoneme 
/s/),  and  the  judgment  of  whether  the  variation  is  dialectical,  co-articulatoiy,  or 
random,  would  be  a  subtle  one.  Further,  even  if  dialectically  determined,  any  given 
phonetic  variation  may  or  may  not  be  obligatory.  Social  status,  discourse  register 
(e.g.,  formal  or  informal  speech)  also  impact  phonetic  dialectical  variations.  Add 
this  to  the  allophonic  variations  across  speakers,  and  the  difficulties  may  be  insur¬ 
mountable. 

C2:  LEXICAL  DIFFERENCES 

Lexical  differences  may  be  the  simplest  dialect  feature  in  terms  of  the  relative 
simplicity  of  the  region-to-lexicon  mapping.  Unique  regional  words 
characteristically  derive  either  from  the  indigenous  (Indian)  language  of  the 
region,  or  from  archaic  Spanish  words  introduced  through  early  Spanish  influence, 
followed  by  lengthy  periods  of  isolation  from  changing  Peninsular  Spanish.  A 
substantial  non-Hispanic  European  population  in  central  LA  (the  Rio  Plata  region) 
has  contributed  some  lexical  items  as  well  (e.g.,  the  ubiquitous  Italian  chiao , 
meaning  adios).  Non-standard  meanings  for  common  lejdcal  items  are  also  common 
regional  markers,  but  are  probably  too  subtle  to  be  exploited  by  any  but  the  most 
sophisticated  speech  processor,  if  at  all. 

Lexical  differences  do  have  a  certain  advantage  as  a  discrimnator,  however. 
Unlike  phonetic  dialectical  features  which  can  be  found  in  isolated  geographic 
pockets  far  removed  from  each  other;  regional  lexical  items  are  most  likely  to  be 
wholly  contained  within  a  fairly  circumscribed  geographic  region;  although  the  size 
of  the  region  can  vary  from  a  single  city  to  a  large  region  covering  several 
countries. 

The  major  problem  with  using  dialect-unique  lexical  features  as  a  dialect 
discriminator  is  the  extent  of  the  vocabulary  that  would  be  necessary  to  support  such 
an  algorithm,  and  the  fact  that  fairly  large  samples  of  speech  would  likely  be 
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necessary.  A  possible  solution  to  this  might  be  to  use  only  high  frequency  words  - 
greetings,  exclamations  and/or  swear  words. 

Although,  as  mentioned,  dialectical  lexical  variations  tend  not  to  appear  in 
widely  separated  geographical  regions,  region-to-item  mapping  can  nevertheless  be 
fairly  complex.  The  variation  in  regional  ranges  of  lexical  items  was  investigated 
using  a  dictionary  of  Chilean  regionalisms.  On  a  single  page,  chosen  at  random, 
were  found  words  both  of  very  limited,  and  of  very  wide  distribution:  some  words 
listed  were  common  to  all  Latin  America;  some  common  only  to  the  Rio  Plata  region; 
others  were  common  to  Chile,  Argentina,  Mexico  and  Peru;  still  others  found  only  in 
Chile  and  Mexico;  some  found  in  Chile,  Ecuador  and  Costa  Rica;  still  others  confined  to 
Chile  and  Argentina;  and  yet  others  found  in  Chile,  Colombia,  Honduras  and 
Venezuela.  While  regional/lexical  mapping  may  be  somewhat  simpler  than 
regional/ phonetic-feature  mapping,  it  is  by  no  means  entirely  simple. 

C3:  MORPHOLOGICAL  AND  SYNTACTIC  DIFFERENCES 

Dialects  in  LA  also  vary  somewhat  morphologically  and  syntactically.  Slight 
differences  in  verb  conjugations  may  appear  in  some  regions,  order  of  object 
pronouns  is  idiosyncratic  in  some  regions,  and  a  handful  of  distinctive  suffixes  or 
affixes  are  unique  to  a  few  circumbscribed  areas  1.  One  of  the  most  common 
parameters  of  dialectical  variation  is  the  use  or  absence  of  the  second  person 
informal  plural,  vosotrosr,  the  alternation  between  vos  and  tir,  and  the  choice  of  verb 
form  used  with  vos.  Use  of  the  direct  object  pronoun  lo  can  also  vary  regionally,  the 
indirect  object  pronoun  le  taking  its  place,  in  some  contexts,  in  many  regions.  Noun 
gender  at  a  lexical  level  can  also  be  idiosyncratic:  e.g.,  el  llamado  in  one  region 
might  be  la  llamada  in  another.  Syntactic  and  morphological  differences,  although 
potentially  eiqjloitable  by  a  trained  Spanish  dialectician  are  probably  not  good 
candidates  on  which  to  base  machine  dialect  ID  since  the  differences  are  often  subtle, 
and  are  probably  too  rare  in  occurence  to  be  used  in  identification  of  short  segments 
of  speech.  Further,  radio  communications  commonly  exhibit  reduced  syntactic 
patterns,  making  it  even  more  ulikely  that  dialect-distinguishing  syntactic  cues 
would  appear  in  radio  transmissions. 

C4:  PROSODIC  DIFFERENCES 


the  use  of  the  affix  rete  in  Mexico,  the  frequency  of  the  augmentative  affix  -azo 
(amigazo,  buenazo)  in  Argentina,  the  Qjiechua  possessive  -y  in  Peru;  or  the 
characteristic  -tico  (momentico,  ahoritica)  of  the  Carribean,  etc. 
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"Intonation  is  perhaps  the  phonological  feature  most  generally  used  by  Latin 
Americans  in  characterizing  and  imitating  the  speeh  of  other  Latin  American 
regions"  (Resnick  1975;  a  similar  comment  is  made  by  Navarro  Tomas  in  his 
introduction  to  Canfield,  1962). 

The  existing  literature  on  Spanish  prosody  does  not  shed  much  light  on  the 
issues  of  machine  dialect  identification.  First  of  all,  studies  on  Spanish  prosody  are 
sparse,  and  those  addressing  differences  between  dialects  is  even  sparser.  Most  of 
the  studies  (many  of  which  pre-date  the  spectrogram)  rely  on  impressionistic 
judgments^  rather  than  acoustic  analysis  of  the  data ,  and  almost  all  focus  on  prosody 
as  conditioned  by  linguistic  and  discourse  factors:  differences  between  declarative 
and  interrogative  sentences;  rising  or  falling  intonation  as  a  marker  of  finality  or 
continuation,  etc.  Further,  most  of  the  studies  were  conducted  with  quite  small 
speaker  samples  (e.g.,  1-4  speakers  of  each  dialect).  Kvavik  and  Olsen  (1974)  review 
the  intonational  literature.  Those  studies  which  contrasted  dialects  are  summarized 
below. 

C4.1  Mexican  Spanish 

Matluck  (1965),  found  that  Mexican  Spanish  is  characterized  by  higher 
frequency  on  the  acccented  syllable  (than  in  "standard"  Spanish),  and  by  level, 
instead  of  falling,  phrase-final  intonation  contours.  Kvavik  (1974)  found  that, 
contrary  to  the  traditional  claim  that  Mexican  Spanish  has  a  greater  frequency 
range  than  Castilian  Spanish,  the  frequency  ranges  of  the  two  are  quite  similar.  The 
largest  difference  she  found  was  in  the  relationship  between  the  average  frequency 
of  a  phrase,  and  the  frequency  of  the  phrase-final  segment.  The  phrase-final 
frequency  of  Mexican  Spanish  tends  to  be  an  average  of  4  Hz  above  the  phrase 
frequency  average,  whereas  in  Castillian  Spanish  it  tends  to  be  4  Hz  below  the 
phrase  average.  She  suggests,  with  Matluck  (1965),  that  even  when  phrase-final 
intonation  remains  level,  the  listener's  expectation  (from  standard  Spanish)  of 
hearing  a  falling  tone  leads  to  the  perception  of  rising  intonation. 

C4.2  Argentinian  Spanish 


^liberman  (1965)  found  that  impressionistic  judgments  about  prosodic 
characteristics  such  as  pitch  level  and  termini  juncture  type,  even  when  done  by 
trained  linguists,  are  highly  variable  (between  linguists),  and  thus  relatively 
unreliable. 
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Malmberg  (1950)^  noted  that  the  Argentinian  accent  is  characterized  by  a 
strong  descending  intonation;  and  both  Malmberg  (1948)  and  Alonso  and  Henriquez 
Urena  (1939)  noted  Italian  influences  in  the  prosody  of  Buenos  Aires  Spanish. 

Alvarez  Puebla  de  Chavez  (1948)  noted  intonational  differences  between 
Buenos  Aires  and  Cordoba  Argentinian  Spanish,  and  Vidal  de  Battini  (1949;  1954) 
delimited  six  different  intonational  regions  in  Argentina,  and  also  mentioned 
intonational  differences  peculiar  to  certain  social  strata.  Fontanella  (1966)  found  a 
series  of  prosodic  differences  between  the  dialects  of  Tucum^  and  of  Buenos  Aires, 
Argentina: 

-  a  difference  in  number  of  contrastive  pitch  levels  in  final  stressed  syllable 

-  Tucuman  has  a  rising  glide  absent  in  Buenos  Aires 

-  a  difference  in  length  and  pitch  of  unstressed  syllables  in  comparison  with 

stressed  syllables 

-  Buenos  Aires  has  a  falling  phrase-final  intonation  option  absent  in  Tucuman 

-  declarative  sentences  in  Tucuman  often  have  a  high  rise  on  final  unstressed 

syllables  (often  mistaken  by  Buenos  Aires  speakers  as  interrogative 
intonation) 

A  further  study  by  Fontanella  (1971)  describes  the  dialect  of  Cordoba  which  is 
characterized  by  an  additional  contrastive  pitch  level,  and  a  lengthening  and  pitch 
change  in  last  pretonic,  tonic,  or  internal  pretonic  syllable. 

C4.3  Chilean  Spanish 

Malmberg  (1948)  suggested  that  indigenous  languages  had  an  effect  on 
Chilean  Andean  intonation.  Gallardo  (1953)  described  a  regional  intonation  on  a 
Chilean  island.  Oroz  (1966)  maintained  that  the  prosody  of  Chilean  Spanish  is  quite 
different  from  that  of  standard  Spanish,  and  briefly  dsecribed  regional  intonational 
differences  which  he  ascribed  to  the  influence  of  indigenous  languages.  He  also 
found  prosody  to  be  alike  in  the  two  Chilean  cities  of  Valparaiso  and  Santiago,  and  in 
neither  were  there  prosodic  differences  based  on  social  class. 

C4.4  Conclusions 


3a11  references  in  the  paragraphs  on  Argentinian  and  on  Chilean  Spanish  are  cited 
in  Kyavik  &  Olsen  (1974).  The  reader  is  referred  to  that  source  for  the  full 
bibliographic  references  (i.e.  they  will  not  be  duplicated  in  the  bibliography  of  the 
present  paper). 
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Although  a  handful  of  studies  have  been  reported,  the  literature  on  prosodic 
differences  between  different  dialects  of  Spanish  is  so  sparse  that  machine  dialect 
identification  based  on  prosodic  cues  identified  by  linguists  is  probably  not  feasible 
at  this  point,  without  a  large-scale  study  to  fill  in  the  large  gaps  in  our  knoweldge. 
However,  it  is  possible  that  more  general,  or  global,  prosodic  differences  could  be 
identified,  and  directly  investigated  with  current  machine  clustering  algorithms 
such  as  those  currently  used  for  speaker  or  language  identification.  Since, 
anecdotally,  native  speakers  of  Spanish  are  fairly  adept  at  identifying  dialect  based 
on  prosodic  cues,there  is  reason  to  believe  that  global  prosodic  characteristics  may 
be  adequate  to  discriminate  between  dialects.^ 

D:  GENERAL  CONCLUSIONS 

Success  of  a  dialect  recognition  algorithm  is  by  no  means  certain.  The 
phonological  differences  (  e.g.,  /l/-/r/  confusions;  aspiration  or  deletion  of  /s/; 
yeismo;  the  presence  or  absence  of  the  phoneme  [zh];  assibilation  of  /r/)  may  well  be 
too  subtle  to  be  utilized  directly  in  a  phoneme-based  dialect  recognition  algorithm. 
Lexical,  morphological  and  syntactic  differences  may  well  require  more 
sophisticated  processing  than  is  currently  available,  or  at  least  than  is  feasible  in  a 
"preliminary  sorting  by  dialect"  application,  since  large  speech  samples  and 
complete  language  analysis  at  many  linguistic  levels  would  be  necessary  to 
disentangle  dialects  based  on  this  information.  Anecdotal  evidence  indicates  that 
dialects  may  be  prosodically  distinguishable,  but  virtually  no  dialectical  analysis 
has  been  done  along  these  lines.  Different  dialects  relatively  systematically 
manipulate  certain  classes  of  phonemes  (stops,  liquids,  aspirants),  which  could  well 
leave  a  trace  in  abstract  acoustic  representations  of  the  speech,  such  as 
Cepstra,  making  genertd  acoustic  analysis  a  possible  route  for  analysis.  Certain 
dialects  (e.g.,  in  the  duechua-influenced  regions  where  vowel-leveling  is  common) 
may  well  be  differentiable  based  on  formant  characteristics,  although  the 
question  remains  open  whether  these  vowel-reducing  dialects  would  even  be 
identified  as  "Spanish"  by  formant-based  language  identification  paradigms  in  the 
first  place. 


■^lieberman  (1965),  in  fact,  claims  that  even  linguists  do  not  use  discrete  cues  to 
identify  prosodic  features  like  pitch  level  and  terminal  juncture  type,  but  rely  on 
more  global  characteristics  such  as  amplitude  and  fundamental  frequency  changes 
over  time. 
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Possible  further  directions  for  research  include: 

-  Extraction  of  pitch  contours  of  different  dialects  to  investigate  whether  they 
have  distinctive  contours  or  ranges. 

-Use  dialect  samples  from  LADD  to  investigate  whether  current  speaker 

identification,  or  language  identification  algorithms  can  be  used  for 
dialect  identification. 

-Use  dialect  samples  from  LADD  to  test  ability  of  current  language 
identification  algorithms  to  succeed  in  dialect  recognition. 

-Use  dialect  samples  from  LADD  to  investigate  whether  trained  dialecticians 

can  discriminate  between  dialects,  either  based  on  the  full  sample,  or  on 
extracted  prosodic  contours. 


Part  n;  LADD  (LATIN  AMERICAN  DIAT.TiCTl 
DATA  BASE  DEVELOPMENT 

A:  INTRODUCTION  AND  DESCRIPTION  OF  LADD 

No  adequate  Spanish  database,  labeled  by  dialect,  currently  exists.  To  remedy 
this,  development  of  such  a  database  was  undertaken,  using  speech  samples  collected 
from  several  Spanish  dialecticians  in  the  United  States  (See  "Sources"  below).  Their 
original  ("source")  recordings  were  made  using  a  potpourri  of  analog  audio 
recording  equipment  for  which,  for  the  most  part,  information  is  no  longer 
available.  However,  since  some  educated  guesses  can  be  made  about  recording/ 
recording  conditions  based  on  other  evidence,  whatever  information  was  available 
(and  this  varied  somewhat  from  sample  to  sample)  was  carefully  documented:  date  of 
source  recording;  name  of  person  making  recording;  name  of  dialectician  in  whose 
collection  it  was  found;  etc. 

The  problems  arising  from  this  characteristic  of  the  database  are  not 
insurmountable,  but  does  demand  that  care  be  taken  in  selecting  samples  for  tests  to 
ensure,  for  example,  that  proposed  dialect  ID  algorithms  are  not  in  reality  merely 
identif3(ing  the  equipment  used  for  the  original  recording.  Alternately,  samples 
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could  be  filtered  before  use  to  level  the  variations  resulting  from  the  variation  in  the 
original  recording  equipment. 

The  advantages  of  the  data  base,  however,  far  outweigh  the  disadvantage  of 
this  variation  in  original  recording  equipment.  First,  this  format  allows  the 
exploitation  of  large  amounts  of  data  that  have  already  been  collected  by  dialecticians 
across  the  U.S.  Second,  in  a  very  real  sense,  it  mimics  the  real-world  data  that  dialect 
ID  algorithms  will  be  expected  to  cope  with,  in  a  way  that  a  collection  of  studio- 
quality  recordings  cannot.  Third,  and  probably  most  importantly,  sample  biases 
inherent  in  other  databases  are  minimized:  speakers,  for  the  most  part,  have  been 
solicited  rather  than  self-selected  (as  in  the  OGI  database);  speakers  range  in 
education  level  from  illiterates  to  holders  of  PhDs,  and  in  occupation  from  army 
generals.  Presidents  and  engineers,  to  sugar  cane  farmers  and  housewives; 
recordings  have  been  made  both  in  the  U.S.  and  in  Latin  America,  in  both  field  and 
studio  conditions;  speech  styles  range  from  formal  read  speech  to  lectures,  speeches, 
harangues,  gossip,  stories,  monologues,  interviews,  and  linguistic  surveys.  This 
information,  whenever  known,  is  carefully  noted  in  the  notes  to  the  database.  While 
these  characteristics  are  not  conducive  to  a  thought-free,  instant  "add  algorithm  and 
run"  approach,  the  database  is  exceedingly  versatile,  and  with  judicious  use  should 
have  almost  unlimited  potential  applications. 

Speech  samples  were  carefully  screened  for  reliability  of  information  about 
the  speaker:  samples  were  only  collected  for  those  speakers  unambiguously 
documented  as  to  country  (or  even  city)  of  origin,  and  sex  of  speaker. 

B:  PROCEDURES 

Audio  recordings  were  borrowed  from  Spanish  dialecticians  (see  "Sources"), 
and  transferred  to  DAT  tape  using  a  Sony  670  High-Density  linear  A/D  D/A 
Converter.  Source  tapes  were  played  on  either  a  Sony  Tapecorder  TC755  Closed  Loop 
Dual  Capstan  or  a  Sony  Tapecorder  TC645  3-Head  reel-to-reel  deck.  If  the  source  tape 
was  a  cassette,  it  was  played  on  a  Tascam  Model  122  cassette  recorder  with  Dolby. 

From  each  2-hour  DAT  tape,  speech  segments  are  transferred  to  computer  hard 
disk,  using  Turtle  Beach  56K  SoundStage  DSP  hardware  and  software  in  sound  files  of 
30  minutes  or  less  duration.  The  speech  files  are  currently  being  segmented  as 
necessary  to  ensure  that  only  the  desired  speaker  is  contained  in  the  sound  file.  In 
some  instances  (e.g.  interviews  and  conversations  )  this  requires  extensive  cutting  so 
that  speech  from  only  one  speaker  is  present  in  any  one  file.  In  other  instances 
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(e.g.,  reading,  monologues  and  formal  speeches)  such  segmentation  is  not  necessary, 
and  was  only  performed  if  there  is  a  break  in  the  audio  recording,  or  a  difference  in 
speech  style  between  one  segment  and  another  (e.g.,  the  first  part  is  a  speech  and  the 
second  part,  for  the  same  speaker,  an  interview).  Cutting  occurred  at  intonation 
boundaries  whenever  possible,  and  always  respected  word  boundaries. 

Each  segment  is  then  labeled  with  a  unique  identifier  providing  information 
about  the  dialect  and  the  speaker.  Data  about  each  speaker,  region  of  origin,  speech 
style,  quality  of  the  sound  file,  the  original  audio  source,  etc.  are  maintained  in  a 
separate  permanent  data  base  (ORACLE)  at  Rome  Labs  IRAA/Speech  Processing. 

In  the  final  phase  digital  binary  sound  files  are  archived  both  on  magneto¬ 
optical  disk  and  magnetic  tape  for  exploitation  by  government  and  contractor 
agencies. 

The  data  base  on  audio  or  on  DAT  tape,  along  with  documentation  is  available  at 
Rome  Labs,  IRAA,  Speech  Processing,  Griffiss  AFB,  New  York  13441,  phone  (315)  330- 
4024. 
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Abstract 

In  this  report  a  straight  forward  approach  for  automatic  extraction  of  drainage  network 
from  Digital  Terrain  Elevation  Data  (DTED)  of  the  Defense  Mapping  Agency  (DMA)  is  presented. 
The  approach  is  based  on  common  image  processing  methods  such  as  frequency  domain  filtering, 
spatial  domain  histogram  equalization,  binarization,  thinning  and  other  techniques.  The  approach 
has  been  examined  with  several  DTED  files.  The  results  obtained  from  this  approach  are  as  good 
as,  or  better  than,  other  methods.  The  approach  was  developed  using  Khoros  image  processing 
system  and  C  programming.  Khoros  has  excellent  visually  based  environment  and  its  rather 
extensive  library  of  functions  helped  avoid  writing  of  commonly  used  codes. 
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AUTOMATIC  EXTRACTION  OF  DRAINAGE  NETWORK  FROM 
DIGITAL  TERRAIN  ELEVATION  DATA 

Mohamad  T.  Musavi  and  Andrew  J.  Laffely 


I.  INTRODUCTION 

A.  Techniques 

Digital  elevation  data  files  are  important  sources  of  information  that  have  been  used  in  many 
remote  sensing  applications  such  as  geometric  rectification  [Wiesel,  85],  terrain  related  radiometric 
correction  [Colby,  91],  image  registration  [Dubayah  &  Dozier,  86],  topographic  map  update 
[Gugan  &  Dowman,  88],  and  extraction  of  such  important  features  as  drainage  networks 
[Argialas,  Rouge,  &  Mintzer,  88;  Jenson  &  Domingue,  88;  Hardipriono,  Lyon,  Li,  &  Argialas, 
90], 

Drainage  networks  have  been  applied  in  water  resource  assessment,  hydrogeometric 
analysis,  interpretation  of  soils  and  rocks,  and  segmentation  of  textures  and  structures.  In 
principle  the  task  is  straight  forward.  Water  drains  from  high  to  low  elevation  along  channels  that 
are  connected  to  one  another  according  to  well-defined  rules  [Parvis,  50;  Howard,  67;  Argialas, 
85].  Majority  of  the  techniques  used  for  extraction  of  water  drainage  are  based  on  low  level  pixel 
(cell)  labeling  by  using  local  operators  and  subsequent  pixel  grouping  according  to  local  criteria 
[O'Callaghan  &  Mark,  84;  Jenson,  84].  Recently,  Qian,  Ehrich,  &  Campbell  [90]  applied  both 
local  operators  and  global  reasoning  to  improve  the  performance  of  previous  results.  Water 
drainage  extraction  techniques  normally  assume  one  pixel  wide  networks  that  may  cause  difficulty 
in  flat  areas  with  flood  patterns  [Lee,  Snyder,  &  Fisher,  92]. 

The  technique  presented  here  is  based  on  two  fundamental  steps.  In  the  first  step,  a 
complete  and  connected  candidate  for  a  possible  drainage  network  is  found.  In  the  second  step, 
the  possible  drainage  network  is  pruned  to  remove  unnecessary  links  that  are  not  part  of  the 
drainage  network.  The  first  step  is  based  on  a  set  of  image  processing  routines  such  as  data 
preprocessing,  frequency  domain  filtering,  spatial  domain  histogram  equalization,  binarization, 
thinning  and  image  blending.  The  Khoros  image  processing  environment  [The  University  of  New 
Mexico,  1990]  has  been  used  in  this  study  because  its  rather  extensive  library  of  functions  helped 
avoid  writing  of  common-used  codes. 
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B .  Databases 


There  are  two  common  sources  of  elevation  data,  the  Defense  Mapping  Agency  (DMA) 
Digital  Terrain  Elevation  Data  (DTED)  and  the  United  States  Geological  Survey  (USGS)  Digital 
Elevation  Model  (DEM)  data.  While  the  experiments  of  the  previous  studies  were  all  conducted  on 
the  DEM  data,  those  of  this  study  used  DTED  database.  This  database  consists  of  many  data  files 
of  digital  terrain  elevation  data.  Each  data  file  is  a  cell  defined  by  a  matrix  with  longitude  and 
latitude  coordinates  (x,  y)  and  elevations  (z).  A  cell  covers  a  1  degree  by  1  degree  geographical 
reference  area.  The  sampling  interval  for  elevation  data  is  3  seconds  by  3  seconds.  Absolute 
horizontal  and  vertical  accuracy  of  DTED  data  are  50  and  30  meters  respectively;  while,  relative 
accuracys  within  a  cell  are  30  and  20  meters  [Military  Specification,  86].  DTED  data  is  available 
on  CDROM  and  9  track  1/2"  magnetic  tape. 

II.  METHODOLOGY 

The  following  sections  present  a  step-by-step  description  of  the  proposed  methodology  for 
automatic  extraction  of  drainage  network.  A  block  diagram  of  the  operations  is  shown  in  Figure  1. 
In  order  to  show  the  result  of  each  operation,  a  sample  DMA  data  file  has  been  selected.  The  file  is 
a  matrix  of  128x128  elevation  data.  A  three  dimensional  plot  of  this  elevation  file  is  shown  in 
Figure  2.  The  elevation  in  this  area  changes  from  420  to  4160  meters  as  indicated  in  the  plot.  For 
the  sake  of  visual  representation  of  the  data  and  the  future  results,  the  elevation  data  is  converted  to 
gray  level  representations  from  0,  for  the  lowest  elevation,  to  255  for  the  highest  elevation.  The 
gray  level  representation  of  elevation  data  of  Figure  2  is  shown  in  Figure  3.  Note  that  all  future 
operations  are  performed  on  the  original  elevation  data  and  only  for  demonstration  purposes  the 
results  are  converted  to  gray  level  images.  In  Figure  3  the  lower  elevations  are  almost  featureless 
because  of  the  great  contrast  of  elevation  present  in  this  data. 

A .  Data  Conversion 

A  DTED  file  supplied  on  CDROM  is  a  1201x1201  matrix  of  elevation  data  with  the  DMA 
data  format.  This  data  is  first  converted  to  the  format  accepted  by  Khoros  image  processing 
software  and  then  is  split  into  smaller  matrices. 
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DTED  File 


Drainage  Network 


Figure  1.  A  block  diagram  for  the  automatic  extraction  of  drainage  networks. 
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B.  High  Pass  Filter 


The  first  step  of  the  data  processing  is  to  bring  up  important  features  such  as  peaks  and 
valleys  that  are  necessary  for  identification  of  water  drainage.  A  common  tool  is  the  application  of 
a  high  pass  filter  (HPF).  This  can  be  performed  both  in  spatial  domain  and  frequency  domain; 
however,  frequency  domain  operation  is  more  accurate  and  provides  more  flexibility  for  changing 
the  cut-off  frequency  of  the  HPF.  The  cut-off  frequency  proves  to  be  an  important  factor  in 
handling  different  geographical  areas.  For  example,  in  an  area  with  high  elevation  changes  the  cut¬ 
off  frequency  can  be  quite  low.  While  in  an  area  with  low  elevation  changes  the  cut-off  frequency 
has  to  be  higher  in  order  to  identify  valleys  with  smooth  changes  that  might  be  part  of  a  drainage 
network  or  flood  pattern. 

To  perform  a  HPF  in  frequency  domain  the  data  is  first  submitted  to  a  Fast  Fourier 
Transform  (FFT)  program  to  convert  the  spatial  domain  data  to  the  frequency  domain  data.  The 
data  is  then  passed  through  the  characteristic  function  of  the  HPF  and  then  to  an  inverse  FFT  to 
recover  the  processed  spatial  data.  Figure  4  shows  the  result  of  a  high  pass  filter  on  the  test  data. 
The  HPF  was  a  first  order  exponential  filter  with  a  normalized  cut-off  frequency  of  0.1. 

C .  Histogram  Equalization 

Although  a  possible  drainage  network  might  be  recognizable  by  human  eye  in  Figure  4, 
the  data  still  poses  difficulties  in  automatic  extraction  by  computer.  To  further  enhance  the  possible 
drainage  network,  the  histogram  of  the  data  is  equalized  using  a  histogram  equalization  program. 
This  step  further  enhances  the  drainage  networks  as  shown  in  Figure  5.  A  comparison  of  Figure  5 
and  3  shows  the  significance  of  the  histogram  equalization  operation. 

D.  Low  Pass  Filter 

Along  with  the  enhancement  of  the  high  frequency  and  histogram  information,  the  noise  in 
the  data  is  also  enhanced.  To  suppress  the  noise,  a  low  pass  filter  (LPF)  is  used.  The  result  of 
LPF  on  the  equalized  data  is  shown  in  Figure  6.  The  filter  used  in  this  example  was  a  second 
order  Butterworth  filter  with  the  normalized  cut-off  frequency  of  0.2.  This  provides  an  overall 
band-pass  effect  on  the  input  data.  Note  that  the  process  of  implementing  LPF  is  similar  to  that  of 
HPF  explained  in  section  B  above. 
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E.  Binarization 

The  elevation  data  can  now  be  segmented  for  isolation  of  the  possible  drainage  area  by  a 
binarization  process.  Using  a  fixed  threshold  for  the  entire  data  file  will  cause  either  loss  of 
drainage  data  or  introduction  of  excessive  points,  depending  on  the  value  of  the  threshold. 
Therefore,  a  variable  threshold  binarization  [Musavi,  Shirvaikar,  Ramanathan,  &  Nekovei,  88] 
was  used.  There  are  several  sub-steps  that  are  taken  to  accomplish  this  binarization.  The 
threshold  for  an  elevation  data  point  n  is  defined  as, 

Tn  =  Kn*STDDEVn  +  MEANn  n  =  1,  2,...,  M  (1) 

M  is  the  total  number  of  data  points  in  the  file.  Kn,  STDDEVn  (standard  deviation),  and  MEANn 
(mean)  are  defined  as, 

Kn  =  a*HIST(n)/M  (2) 

MEANn  =  S  Pi/(2*d +1)2,  i  =  0,  1,  2, ...,  (2*d +1)2  -  1)  (3) 

STDDEVn  =  [2:(Pi  -  MEANn)2/(2*d  +1)2]  (4) 

Where  a  is  a  constant  (3  in  our  experiments).  HIST(n)  is  the  histogram  element  corresponding  to 
the  n-th  data  point,  P,-  is  the  value  of  the  i-th  data  point  in  a  local  area  around  the  n-th  point,  and  d 
determines  the  size  of  the  local  area.  For  example,  d=l  for  a  3x3  area,  d=2  for  a  5x5  area,  and  so 
on.  The  result  of  the  variable  threshold  binarization  is  shown  in  Figure  7.  One  option  for  drainage 
network  extraction  was  to  apply  a  thinning  algorithm  at  this  stage,  the  result  of  which  was  a  fairly 
well  connected  set  of  paths.  However,  the  thinning  algorithm  output  might  not  accurately  provide 
the  drainage  paths  for  all  areas.  This  is  especially  true  where  a  given  path  is  bordered  on  one  side 
by  sharp  elevation  changes  and  on  the  other  by  more  gradual  ones. 

F.  Local  Minima 

The  above  binarization  process  provides  a  general  area  of  the  possible  drainage  network. 
The  local  minima  in  this  general  area  provides  a  more  accurate  representation  of  the  drainage 
network.  One  should  note  that  the  local  minima  alone  will  not  be  sufficient  to  provide  a  connected 
drainage  network  due  to  its  local  reach.  The  significance  of  the  generalized  drainage  area  is  in  its 
global  nature,  global  in  a  sense  that  it  provides  a  connected  path  for  the  network.  Therefore,  a 
merger  between  the  general  area  and  the  local  minima  will  provide  a  much  better  data  set  for 
extraction  of  the  actual  drainage  network. 
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A  simple  routine  is  used  to  find  the  local  minima  in  both  the  x,  y,  and  both  diagonal 
directions  from  the  original  data.  Each  data  point  in  the  original  elevation  file  is  compared  to  the 
four  that  precede  it  and  the  four  that  follow  it  (9  points  in  each  local  array).  If  the  elevation  of  the 
test  point  is  lower  or  equal  to  all  the  points  of  the  local  area,  a  corresponding  point  in  a  binary 
representation  file  is  set  to  1.  If  any  one  of  the  points  in  the  local  array  is  lower  than  the  test  point, 
then  a  0  will  be  recorded  for  the  test  point.  This  process  is  repeated  for  all  rows,  columns  and 
diagonals  of  the  data  file.  As  a  result,  any  point  in  the  output  file  can  be  set  based  on  only  one 
testing  direction.  This  enables  the  extraction  of  drainage  paths  in  the  four  connected  neighbors  of 
the  test  pixel.  Figure  8  shows  a  binary  file  indicating  the  local  minima  of  the  original  elevation 
data.  Notice  in  Figure  8  that  some  points  where  the  drainage  paths  seem  to  converge  are  not  set. 
This  is  a  direct  result  of  the  local  minima  algorithm  and  part  of  the  reason  that  it  alone  is  unable  to 
extract  a  connected  drainage  path. 

G .  Local  Blending  and  Global  Thresholding 

After  the  two  binary  files  are  created,  the  local  minima  and  the  variable  threshold 
binarization  outputs,  they  need  to  be  combined  in  a  manner  that  enhances  the  connectivity  and 
reduces  noise.  First,  a  weighted  sum  of  the  two  files  is  used  to  combine  the  information  into  a 
single  image.  The  optimization  of  the  weighting  factor  has  not  been  extensively  studied  but  adding 
one  part  variable  threshold  information  and  two  parts  local  minima  data,  has  given  consistent 
results.  In  other  words,  after  the  summation  a  4  level  (0,  1,  2,  and  3)  data  file,  as  shown  in 
Figure  9,  is  created.  Note  that  the  local  minima  helps  to  identify  the  accurate  location  of  the 
drainage  network  in  the  general  area  of  drainage. 

Considering  only  the  local  minimum  points  of  Figure  8,  one  can  identify  a  possible 
drainage  network  that  is  not  completely  connected.  However,  the  information  in  the  general 
drainage  area  can  be  used  to  create  the  appropriate  connectivity.  One  way  to  do  this  is  by  a  local 
blending  or  smoothing  operator  on  the  4  level  data  file  created  above.  In  the  experiments  reported 
here  two  identical  smoothing  operators,  7x7  averaging  masks,  were  applied  in  sequence  to  the  4 
level  data  file.  The  result  of  this  operation  is  shown  in  Figure  10.  This  operation  closes  small 
gaps  in  the  local  minima  drainage  network  while  reducing  noise.  After  smoothing,  a  global 
thresholding,  set  at  1.5,  is  applied  to  successfully  extract  a  possible  drainage  network.  Figure  1 1. 

H .  Thinning 

The  output  of  section  G  above  is  a  possible  drainage  network  with  more  than  one  pixel  in 
width.  A  thinning  algorithm  can  be  applied  to  find  a  one  pixel  wide  network.  Figure  12.  At  this 
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stage  there  might  exist  certain  isolated  pixels  or  segments  that  are  created  as  the  results  of  thinning 
algorithm.  These  sporadic  pixels  can  be  removed  by  a  simple  length  thresholding  filter.  The 
thinned  drainage  network  of  Figure  12  has  been  overlaid  on  the  original  data  in  Figure  13. 

I.  Direction  of  Flow 

With  a  nearly  complete  drainage  network,  several  other  steps  of  processing  may  be  done  to 
extract  more  information  from  the  DTED  file.  One  of  these  is  the  drainage  flow  direction.  Some 
approaches  to  this  have  been  investigated  as  part  of  this  program,  but  complete  methods  and  code 
have  not  been  developed  due  to  time  limitation.  The  next  several  paragraphs  contain  this  approach. 

The  input  to  this  stage  of  processing  could  come  from  one  of  three  different  steps  described 

earlier. 

First,  the  thinned  drainage  network  of  part  H  could  be  used.  This  being  the  final  result  of 
the  tested  system  thus  far  makes  it  the  obvious  choice,  but  the  problems  with  the  Khoros  thinning 
algorithm  especially  the  connectivity  ones  could  create  problems  in  finding  the  direction  of  flow.  If 
a  new  thinning  algorithm  were  developed  this  would  be  an  excellent  place  to  begin. 

Second,  the  wide  drainage  network  of  part  G  could  be  used.  To  do  this,  the  direction  of 
flow  algorithm  could  be  given  two  tasks.  One,  to  find  the  flow  direction  of  each  pixel  in  the  wide 
network  and  two,  to  use  this  information  to  thin  the  resulting  network  where  needed.  The  result  of 
this  may  give  much  more  insight  than  the  one  pixel  drainage  system  of  step  H,  by  actually  allowing 
rivers  to  have  a  width.  This  process  will  be  discussed  further. 

Third,  if  a  wide  network  can  be  used  and  then  thinned  based  on  direction  of  flow,  it  is 
conceivable  to  use  the  output  of  the  variable  threshold  binarization  as  a  starting  point.  This  would 
cut  several  steps  of  processing  including  the  time  consuming  smoothing  steps. 

At  this  point  the  direction  of  flow  algorithm  will  be  described  in  terms  of  the  second  input 
option,  however  the  methods  presented  here  could  be  transformed  to  meet  the  demands  of  the  other 
two  input  cases.  The  input  image  in  all  three  cases  is  binary  with  one  level  equal  to  the  value  1, 
and  the  zero  level  is  0. 

1)  The  first  step  of  this  approach  is  to  multiply  the  binary  drainage  network  file  by  the 
original  DTED  file  pixel  by  pixel.  This  will  weight  each  pixel  in  the  drainage  network  by  its 
elevation  value. 

2)  Create  a  mask  image  consisting  of  the  connectivity  of  the  above  network.  In  this 
file  all  none  zero  pixels  contained  in  a  connected  drainage  path  will  be  assigned  a  numerated 
cluster.  The  number  of  clusters  will  be  the  number  of  individual  drainage  networks  in  the  test  area. 
This  step  is  not  too  difficult  to  construct  and  is  almost  complete.  It  involves  searching  the  input  file 
pixel  by  pixel  till  a  non-zero  pixel  is  reached;  the  location  of  this  is  then  stored.  Once  the  first  non- 
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zero  pixel  is  reached  all  other  non-zero  pixels  that  are  connected  to  it,  be  it  directly  as  a  neighbor 
pixel  or  through  a  complicated  string  of  non-zero  pixels  are  stored  in  the  connection  mask  and 
assigned  to  a  cluster.  After  all  the  pixels  for  that  network  have  been  identified,  a  file  search  for 
another  non-zero  pixel  begins  one  pixel  after  the  location  of  the  first  pixel  in  the  last  cluster.  This 
time  the  search  will  not  only  look  for  non-zero  pixels  but  also  check  whether  the  pixel  has  been 
assigned  to  a  cluster  to  avoid  repetition  of  clustering.  The  search  will  continue  to  the  last  pixel  in 
the  file  assuring  that  all  possible  networks  have  been  found. 

3)  With  the  clustering  of  networks  a  length  threshold  can  easily  be  used  to  remove 
insignificant  or  error  drainage  paths. 

4)  Create  a  pixel  by  pixel  direction  mask  by,  comparing  the  elevation  of  each  pixel 
with  its  immediate  neighbors  and  selecting  the  neighbor  pixel  with  the  greatest  absolute  difference 
as  the  direction.  A  more  elaborate  method  is  to  take  a  weighted  average  of  all  possible  directions, 
but  this  is  more  complicate  and  may  not  be  necessary. 

5)  Perform  thinning  and  path  check  in  one  (iterative)  step.  Test  each  direction  against 
the  input  file.  If  the  direction  of  the  test  pixel  points  from  a  zero  pixel  on  one  side  to  a  non-zero 
pixel  on  the  other,  delete  it  from  the  network.  If  the  direction  of  the  test  pixel  points  from  a  non¬ 
zero  pixel  to  a  zero  pixel,  change  the  non-zero  pixel  to  1.  If  the  direction  of  the  test  pixel  points 
from  one  non-zero  pixel  to  another,  do  nothing.  This  procedure  will  delete  some  of  the  network  at 
the  higher  elevations,  but  in  those  regions  the  drainage  network  will  not  be  visible  in  most  other 
mapping  media.  Clustering  and  directions  will  need  to  be  recalculated  after  each  iteration.  It  is 
uncertain  what  number  of  iterations  will  be  sufficient  to  obtain  the  correct  network. 

6)  Use  the  pixel  directions  to  obtain  a  global  sense  of  drainage  direction.  This  will  be 
done  by  averaging  the  directions  along  each  path. 

This  approach  is  untested  and  may  need  to  be  revised. 

J .  Drainage  Vectorizatoin 

A  complete  network  or  even  a  partial  network  can  be  represented  as  a  set  of  connected  or 
disconnected  vectors.  Each  network,  of  sufficient  length,  can  be  simplified  to  a  set  of  vectors  that 
follow  its  path.  The  resulting  set  of  vectors  may  be  useful  in  registration  of  satellite  imagery  that 
has  a  perpendicular  frame  of  reference.  The  relative  vector  lengths  and  angles  should  be  similar  in 
the  DTED  and  imageiy  allowing  the  registration  process  to  overcome  rotational,  translational  and 
scaling  problems. 
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III.  CONCLUSION 


A  straight  forward  technique  has  been  presented  for  automatic  extraction  of  drainage 
network  from  Digital  Terrain  Elevation  Data  (DTED)  of  the  DMA.  It  has  been  shown  that  an 
appropriately  assembled  set  of  commonly  used  image  processing  techniques  is  able  to  extract 
drainage  networks  from  elevation  data.  However,  there  are  other  issues  that  need  to  be  addressed 
before  the  proposed  methodology  can  be  adopted  as  a  robust  and  reliable  tool  for  the  task.  Among 
these  issues  are  determination  of  direction  of  flow,  test  of  the  methodology  with  elevation  data 
from  different  geographical  areas,  test  of  the  accuracy  by  comparing  the  extracted  drainage 
networks  with  ground  truth  data,  and  refinement  and  optimization  of  the  parameters  used  in  the 
methodology.  A  neural  network  approach  could  have  also  been  investigated  and  devised  for  the 
task,  however,  due  to  lack  of  training  data,  the  implementation  was  not  feasible  at  this  stage.  In 
addition  to  many  useful  application  of  drainage  networks,  they  can  also  be  used  for  automatic 
registration  of  unregistered  images  because  drainage  networks  are  unique  features  of  elevation  data 
that  can  be  used  effectively  for  registration  purposes.  Expansion,  modification,  and  utilization  of 
the  developed  methodology  will  be  addressed  in  future  research  proposals. 
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Figure  2.  A  three  dimensional  plot  of  a  sample  elevation  data. 
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Figure  3.  A  gray  level  representation  of  the  sample  elevation  data. 


Figure  4.  The  result  of  a  high  pass  filter  on  the  sample  elevation  data. 
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Figure  5.  The  result  of  a  histogram  equalization  on  data  of  Figure  4. 
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Figure  8.  A  binary  representation  of  the  local  minima  of  the  sample  elevation  data. 
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Figure  9.  A  four  level  image  created  from  the  general  drainage  area  and  the  local  minima. 
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Figure  11.  The  result  of  a  global  thresholding  on  the  data  of  Figure  10. 


Figure  12.  The  result  of  thinning  of  the  data  of  Figure  10,  a  possible  drainage  network. 
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INFRARED  IMAGES  OF  ELECTROMAGNETIC  FIELDS 

John  D.  Norgard 
Professor/ECE 

Department  of  Electrical  &  Computer  Engineering 
University  of  Colorado 

Abstract 

A  new  infrared  (IR)  imaging  technique  is  being  developed  to  measure  two-dimensional 
electromagnetic  (EM)  field  distributions  near  a  radiator  or  a  scattering  body.  Both  electric  and  magnetic 
field  intensities  can  be  measured  with  this  technique.  Initial  tests  to  prove  the  validity,  accuracy  and 
sensitivity  of  this  new  technfque  were  performed  this  summer  in  the  anechoic  chamber  at  the 
Electromagnetic  Vulnerability  Analysis  Facility  (EMVAF)  at  Rome  Laboratory  (RL). 

External  radiation  tests  were  performed  on  a  conducting  cylinder  irradiated  by  an  incident  plane 
wave.  E-  and  H-  field  patterns  of  the  scattered/diffracted  energy  from  the  cylinder  where  measured. 
Tests  were  conducted  at  various  microwave  frequencies  relative  to  the  resonant  frequencies  associated 
with  the  cylinder  and  at  numerous  angles  of  incident  (eg.  end-on,  broad-side,  oblique-incidence)  and 
at  several  different  polarizations  of  the  incident  field  relative  to  the  axis  of  the  cylinder  (eg.  horizontal, 
vertical,  and  skewed).  Tests  were  performed  in  the  near  and  far  fields  of  the  cylinder. 

Internal  coupling  tests  were  performed  to  determine  the  induced  energy  coupled  through  long, 
thin  slot  apertures  at  various  orientations  in  the  side  of  the  cylinder  (eg.  axial  and  longitudinal  slots). 

The  scattering  characteristics  of  several  dielectric  cylinders  and  spheres  were  also  tested. 
Metallic  and  dielectric  half-cylinders  and  half-spheres  were  tested  for  their  scattering  characteristics. 

The  radiation  pattern  of  a  standard  gain  horn  (aperture  antenna)  was  determined  using  the  IR 
imaging  technique.  Radiation  patterns  in  the  E  and  H  planes  were  taken  in  the  near  and  far  fields  of 
the  horn  aperture.  Several  cross-sectional  patterns  were  also  taken  in  the  near  field  of  the  horn  and 
in  the  aperture  plane. 

These  simple,  canonical  objects  were  tested  to  validate  the  accuracy  of  the  IR  technique.  In 
all  validation  tests,  the  relative  error  was  typically  only  7  to  8  percent.  Therefore,  this  new  EM 
measurement  technique,  if  further  developed,  has  the  potential  to  provide  extremely  accurate  EM  field 
measurements. 

Three  papers  were  presented  at  international  IR  and  EM  conferences  this  summer;  three 
seminars  were  also  presented  on  the  technique.  One  paper  has  been  submitted  to  an  IR  journal  for 
publication;  three  new  papers  have  been  submitted  for  presentation  at  several  IR  conferences  next 
year. 
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INFRARED  IMAGES  OF  ELECTROMAGNETIC  FIELDS 
John  D.  Norgard 

An  infrared  (!R)  measurement  technique  is  being  developed  to  measure  electromagnetic  (EM) 
fields.  This  technique  uses  a  minimally  perturbing,  thin,  planar  IR  detection  screen  to  produce  a 
thermal  map  of  the  intensity  of  the  EM  energy  over  a  two-dimensional  region.  EM  fields  near  radiating 
microwave  sources  and  scattering  bodies  can  be  measured  with  this  technique.  This  technique  also 
can  be  used  to  correlate  theoretical  data  with  experimental  observations  and  to  experimentally  validate 
complicated  numerical  codes  which  predict  electric  field  distributions  inside  waveguide  cavities  (E- 
Fields)  and  surface  current  distributions  on  metallic  surfaces  (H-Fields). 

1 .  Introduction 

A  non-destructive,  minimally  perturbing  IR  measurement  technique  is  being  developed  to 
observe  EM  fields.  Metallic  surface  currents  and  charges  also  can  be  measured  with  this  technique. 

This  IR  measurement  technique  produces  a  two-dimensional  IR  thermogram  of  the  electric  or 
magnetic  field  being  measured,  i.e.  a  two-dimensional  isothermal  contour  map  or  a  gray  scale  of  the 
intensity  of  the  EM  field. 

Electric  and  magnetic  fields  can  be  measured  separately.  For  example,  electric  field  patterns 
radiated  from  microwave  horn  antennas,  electric  field  intensities  coupled  through  apertures  in  shielded 
enclosures,  diffraction  patterns  of  electric  fields  scattered  from  complicated  metallic  objects,  and 
electric  field  modal  distributions  induced  inside  cylindrical  waveguide  cavities  can  be  measured.  Also, 
magnetic  field  distributions  near  conductive  surfaces  and  induced  surface  currents  on  metallic  surfaces 
can  be  determined. 

The  advantages  and  disadvantages  of  this  new  IR  measurement  technique  are  discussed  later. 
2-  IR  Measurement  Technique  (Overview) 

The  IR  measurement  technique  is  based  on  the  Joule  heating  that  occurs  in  a  lossy  material 
as  an  EM  wave  passes  through  the  material.  A  thin,  planar  sheet  of  lossy  carbon  loaded  paper  can  be 
used  to  map  electric  fields;  a  thin,  planar  sheet  of  lossy  ferrite  loaded  epoxy  can  be  used  to  map 
magnetic  fields.  In  either  situation,  the  absorbed  heat  energy  is  converted  into  conducted  and 
convected  heat  energy  and  into  re-radiated  EM  energy.  The  radiated  EM  energy  is  concentrated  in  the 
IR  band.  This  "black  body"  energy  can  be  detected  with  an  IR  (Scanning)  Array  or  with  an  IR  (Starring) 
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Focal  Plane  Array  (FPA). 


2.1 .  IR  Experimentai  Setup 

This  technique  involves  placing  a  lossy  non-perturbing  IR  detection  screen  in  the  plane  over 
which  the  EM  fields  are  to  be  measured. 

2.1.1.  Electromagnetic  Parameters 

The  detector  screen  can  be  made  from  a  thin  sheet  of  linear,  homogenous  and  isotropic  but 
lossy  material.  From  the  complex  form  of  Poynting's  Theorem  for  a  linear,  homogeneous  and  isotropic 
material,  the  absorbed  power  P^^*  within  a  given  volume  V  of  the  lossy  material  is  a  function  of  the 
electric  (E)  and  magnetic  (H)  field  intensities  inside  the  screen  and  is  given  by 

^ahs  ^  f  dV  {W/irP']  (1) 

where  a  is  the  conductivity  of  the  detector  screen,  e"  is  the  imaginary  component  of  the  permittivity 
of  the  detector  screen,  is  the  imaginary  component  of  the  permeability  of  the  detector  screen,  and 
(X)  is  the  radian  frequency  of  the  incident  field.  The  volume  integral  is  over  the  illuminated  portion  of 
the  detector  screen.  The  spectral  characteristics  of  the  complex  constitute  parameters  (jj,  e,a)  of  the 
detector  material  must  be  known  (or  measured)  over  the  entire  frequency  bandwidth  to  be  measured. 

The  incident  EM  energy  is  absorbed  by  the  lossy  material  and  is  converted  into  thermal  heat 
energy  which  causes  the  temperature  of  the  detector  material  to  rise  above  the  ambient  temperature 
of  the  surrounding  background  environment  by  an  amount  which  is  proportional  to  the  local  electric 
and/or  magnetic  field  intensity  (energy)  at  each  point  (pixel)  in  the  screen  material.  In  regions  where 
the  fields  are  strong,  the  absorbed  energy  is  large  and  the  resulting  pixel  temperatures  are  high;  in 
regions  where  the  fields  are  weak,  the  absorbed  energy  is  small  and  the  resulting  pixel  temperatures 
are  low.  The  resulting  two-dimensional  temperature  distribution  over  the  surface  of  the  screen  can  be 
detected,  digitized  and  stored  in  the  memory  of  an  IR  camera.  The  temperature  distribution  on  the 
surface  of  the  screen  without  any  EM  energy  incident  on  the  screen  also  can  be  stored  in  the  memory 
of  the  IR  camera  as  a  ambient  background  reference  temperature  distribution.  The  difference  in  the 
temperature  distributions  at  each  pixel  (between  the  illuminated  and  the  non-illuminated  screen)  is  due 
to  the  effects  of  the  electric  or  magnetic  field  incident  on  the  screen  at  that  pixel  location. 

The  EM  effects  can  be  visualized  by  presenting  the  differenced  two-dimensional  temperature 
profile  as  a  false  color  image,  where  cool  colors  (for  example  shades  of  blue)  represent  weak  areas  of 
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EM  energy  and  hot  colors  (for  example  shades  of  red)  represent  strong  areas  of  EM  energy.  The 
resulting  two-dimensional  false  color  image  is  called  an  IR  thermogram,  ie.  an  iso-temperature  contour 
map,  and  is  a  representation  of  the  electric  and/or  magnetic  field  distribution  passing  through  the 
screen  material. 

2.1.2.  Thermal  Parameters 

For  a  planar  sheet  of  detector  screen  supported  by  a  block  of  non-conducting  material,  eg.  a 
styrofoam  block,  the  thermal  problem  reduces  to  considering  only  the  radiative  and  convective  heat 
losses  from  the  surface  of  the  detector  material. 

The  convective  heat  loss  h^  is  approximated  by 

[w/m^]  (2) 

where  h^  varies  between  1 .4  and  1 .6.  The  radiative  heat  loss  h^  is  approximated  by 

(3) 

where  is  the  detector  surface  emissivity,  o;,  is  the  Stefan-Boltzman  constant  in  W/m^  -  K^,  and  the 
temperatures  are  in  degrees  Kelvin.  The  conductive  heat  loss  is  negligibly  small. 

2.1.3.  Thermal  Equilibrium 

The  heat  transfer  problem  in  the  detector  material  involves  solving  a  non-linear,  second  order 
differential  equation  in  both  space  and  time,  while  considering  radiative  and  convective  heat  losses 
from  the  surface  of  the  material,  conductive  heat  transfer  within  the  material,  and  the  EM  power 
absorbed  in  the  material  as  a  function  of  distance  into  the  material.  For  the  case  of  the  thin  screens 
considered  here,  the  temperature  is  initially  considered  to  be  constant  in  the  direction  normal  to  the 
surface  of  the  material,  so  that  the  conductive  term  normal  to  the  surface  of  the  screen  can  be  ignored 
and  the  power  absorbed  can  be  considered  independent  of  the  direction  normal  to  the  surface  of  the 
screen.  Also,  the  time  dependence  of  the  absorbed  heat  energy  is  ignored  for  the  steady-state 
solution. 

Relating  the  convective  and  radiative  heat  losses  in  equations  (2)  and  (3)  to  the  absorbed  power 
in  equation  (1),  results  in  the  following  equation  at  thermal  equilibrium: 

For  a  properly  optimized  detector  screen,  thermal  equilibrium  is  achieved  in  just  a  few  seconds. 
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This  non-linear  thermal/electricai  equation  can  be  solved  for  the  electric  or  magnetic  field,  as 
a  function  of  the  material  temperature  T,  using  approximate  techniques. 

2.2.  Approximate  Solution 

Equation  (4)  is  highly  non-linear  for  large  temperature  variations  above  ambient,  due  to  the 
thermal  processes  of  convection  and  radiation.  However,  for  small  temperature  variations  of  only  a 
few  degrees  above  ambient,  equation  (4)  can  be  linearized  for  small  incremental  temperature  changes 
AT  =  T  -Ta^b  above  the  ambient  temperature 

This  condition  of  small  temperature  variations  above  ambient  is  a  desirable  operational 
constraint,  since  this  is  also  the  requirement  for  small  absorption  of  the  EM  energy  passing  through  the 
screen,  which  equates  to  a  small  perturbation  of  the  incident  field  when  performing  the  measurement. 
For  this  minimally  perturbing  measurement  case,  an  almost  direct  linear  correlation  exists  between  the 
incremental  surface  temperature  AT  and  the  absorbed  electric  or  magnetic  field  intensity.  The  incident 
electric  or  magnetic  field  can  then  be  determined  from  a  solution  of  Maxwell's  Equations  (Fresnel's  and 
Snell's  laws)  for  an  EM  wave  incident  on  a  planar  film  of  lossy  material.  Therefore,  it  is  possible  to 
correlate  local  surface  temperature  variations  AT  to  E  or  H  field  intensities. 

For  the  FPA  camera  available  this  summer  at  Rome  Laboratory  to  make  the  IR  thermograms, 
temperature  differences  AT  as  small  as  0.01  could  be  detected. 

Care  is  exercised,  therefore,  in  the  selection  of  the  screen  material  not  to  significantly  perturb 
the  electric  or  magnetic  field  by  the  presence  of  the  lossy  material.  The  screen  can  be  designed  to 
absorb  from  1  %  to  5%  of  the  incident  power  and  to  produce  a  temperature  change  of  less  than  a  few 
degrees.  The  constitutive  parameters  of  the  IR  detector  screen  can  be  optimized  to  produce  a  large 
temperature  rise  in  the  detector  material  for  a  small  amount  of  absorbed  energy. 

Electric  and  magnetic  fields  produced  by  Continuous  Wave  (CW)  sources  operated  in  the 
sinusoidal  steady-state  mode  are  easy  to  measure  because  of  the  large  amount  of  energy  contained 
in  the  wave.  Transients  produced  by  high  power  microwave  (HPM)  pulsed  sources,  especially 
repetitively  pulsed  sources,  also  can  be  measured,  if  the  average  energy  content  in  the  pulse  is  high 
enough  to  raise  the  temperature  of  the  detector  screen  material  above  the  minimum  temperature 
sensitivity  of  the  IR  camera.  The  thermal  mass  of  the  detector  material  should  hold  the  absorbed  heat 
energy  long  enough  to  capture  the  IR  thermogram  of  the  pulse. 

2.3.  IR  Detector  Screen 

Referring  to  equation  (1 ),  the  screen  material  can  be  tailored  to  respond  to  only  one  component 
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of  the  field,  e.g.  by  optimizing  the  values  of  the  electrical  conductivity  a  and  the  Imaginary  part  of  the 
permittivity  6"  of  the  material  relative  to  the  imaginary  part  of  the  permeability  /y”  of  the  material,  the 
detector  screen  can  be  made  sensitive  either  to  the  tangential  component  of  the  electric  field  or  to  the 
tangential  component  of  the  magnetic  field  in  the  plane  of  the  screen. 

For  example,  an  electric  field  detector  screen  can  be  constructed  either 
i)  from  a  lossy  material  with  a  high  conductivity  a  and  a  low  imaginary  permittivity  e"  and  a  low 
imaginary  permeability  //" 

or  ii)  from  an  electrically  polarizable  material  with  a  high  imaginary  permittivity  6"  and  a  low 

conductivity  a  and  a  low  imaginary  permeability  //". 

Alternatively,  a  magnetic  field  detector  screen  can  be  constructed  from  a  magnetically 
polarizable  (magnetizable)  material  with  a  high  imaginary  permeability  /y”  and  a  low  conductivity  a  and 
a  low  imaginary  permittivity 

The  optimization  of  the  thermal  and  electrical  parameters  of  the  detection  screen  material 
should  be  guided  by  a  thermal/electrical  computer  code  based  on  a  plane  wave  normally  incident  on 
a  planar  interface  between  air  and  the  lossy  detector  material.  Other  absorptive  and  re-emittive 
transducing  materials  have  been  studied  for  use  as  passive  thermal  screens  for  IR  thermograms. 

2.3.1  Electric  Field  Detector  Screen 

For  the  detection  of  electric  fields,  the  IR  detector  screen  can  be  made  from  a  planar  sheet  of 
lossy  thin-film  material  .  Several  different  detector  screens  were  used  to  make  electric  field 
thermograms.  One  was  a  carbon  loaded  thin  film  (eg.  Teledeltos  Paper)  80  /ym  thick  with  a 
conductivity  of  8  mhos  per  meter.  The  other  screens  were  made  from  carbon-loaded  Kapton  films. 
The  films  were  loaded  with  different  resistivities  per  square.  These  materials  are  non-polarizable  and 
non-conducting;  therefore,  the  imaginary  components  of  the  permittivity  e”  and  the  permeability  /y”  are 
negligibly  small.  For  these  non-conducting,  non-polarizable,  non-magnetic  screen  materials,  maximum 
heating  occurs  due  to  the  electric  field  and  negligible  heating  occurs  due  to  the  magnetic  field. 

For  plane  waves  normally  incident  on  this  carbon  loaded  electric  field  detection  screen  and  for 
an  IR  FPA  with  a  temperature  sensitivity  of  0.01  ®K,  electric  fields  with  a  magnitude  on  the  order  of 
61 .4  V/m  (1  mW/cm^  of  incident  power)  could  be  detected.  This  result  was  obtained  empirically  from 
experimental  data  in  which  the  incident  power  level  was  incrementally  decreased  until  no  electric  field 
contour  lines  on  the  IR  thermogram  were  discernable  from  the  ambient  background  level. 

2.3.2  Magnetic  Field  Detector  Screen 
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For  the  detection  of  magnetic  fields,  an  IR  detection  screen  can  be  fabricated  using  a  ferrite 
loaded  epoxy.  Several  different  ferrite  powers  have  been  tested,  viz.  Nickel  Zinc  Copper  Ferrite,  Iron 
Silicide  Ferrite,  Ferrite  50,  etc.  The  best  detector  screen  used  to  make  magnetic  field  thermograms 
had  an  80%  by  weight  mixture  of  Ferrite  50  with  a  thickness  of  0.5  mm  and  an  imaginary  relative 
permeability  of  approximately  2.  The  imaginary  component  of  the  permittivity  e"  and  the 
conductivity  a  were  negligibly  small.  For  this  magnetic,  non-conducting,  non-electric  screen  material, 
maximum  heating  occurs  due  to  the  magnetic  field  and  negligible  heating  occurs  due  to  the  electric 
field. 

In  the  fabrication  of  a  magnetic  screen,  it  is  difficult  to  keep  the  electric  polarizability  of  the 
epoxy  base  material  from  contributing  to  the  imaginary  component  of  the  permittivity  e"  of  the 
composite  and  ,thereby,  introducing  additional  electric  polarization  losses.  If  the  electric  properties  of 
the  ferrite  are  not  carefully  controlled  and  minimized,  the  absorbed  power  will  be  due  to  both  the 
electric  and  magnetic  components  of  the  EM  field,  as  given  by  equation  (1 ).  In  this  case,  it  is  difficult, 
if  not  impossible,  to  separate  the  two  coupling  mechanisms  from  each  other  and  to  calibrate  the 
technique. 

To  detect  surface  currents  on  a  metallic  structure,  the  magnetic  detection  screen  can  be  placed 
in  direct  contact  with  the  surface  of  the  metal.  The  thickness  of  the  detector  screen  must  be  kept  to 
a  minimum,  so  that  the  magnetic  field  that  is  measured  on  the  outer  surface  of  the  screen  is  a  direct 
reflection  of  the  magnetic  field  on  the  inner  surface  of  the  screen  in  contact  with  the  surface  of  the 
metal.  The  material  can  be  held  flat  against  the  surface  of  the  metal  using  a  non-conducting  glue. 

The  surface  current  is  related  to  the  tangential  component  of  the  magnetic  field  intensity  on 
the  surface  of  the  metal  by  the  magnetic  boundary  condition  on  a  perfect  conductor: 

M  X  I,  =  /, 

where  n  is  the  normal  to  the  surface  S.  The  magnetic  field  is  perpendicular  to  the  direction  of  the 
surface  current. 

2.4  IR  Camera 

The  temperature  difference  between  the  screen  material  and  the  background  is  detected, 
digitized,  and  can  be  stored  in  the  memory  of  an  IR  camera  on  a  pixel  by  pixel  basis.  The  IR  imaging 
system  used  to  take  the  IR  thermograms  has  approximately  200  by  100  pixels  per  frame  of  data.  The 
detector  is  scanned  over  the  image.  The  detector  is  a  Mercury-Cadmium-Teluride  (HgCdTe)  IR  photo 
detector  operated  in  a  photo  voltaic  mode.  The  detector  operates  at  liquid  Nitrogen  temperatures.  The 
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camera  can  detect  temperature  differences  of  approximately  0.01  ®K,  and  has  a  relative  accuracy  of 
plus  or  minus  10%  when  detecting  EM  fields. 

2.5  IR  Images 

The  stored  IR  thermogram  data  represents  the  temperature  distribution  over  the  extent  of  the 
detector  screen  and  is  a  map  of  the  intensity  of  the  electric  or  magnetic  field  distribution  absorbed  in 
the  screen.  For  small  temperature  rises  less  than  a  few  degrees  above  ambient,  the  electric  and 
magnetic  field  intensities  are  nearly  linearly  proportional  to  the  temperature  change. 

2.5.1  Spatial  Resolution 

The  spatial  resolution  of  the  IR  thermogram  is  a  function  of  the  number  of  pixel  elements  in  the 
IR  camera,  and  is  fixed  by  the  angular  resolution  of  the  telephoto,  regular  or  wide  angle  lens  used  on 
the  IR  camera  when  making  the  IR  image.  A  telephoto  lens  can  be  used  to  look  at  small  details  in  the 
field  structure  on  the  detector  screen;  a  wide  angle  lens  can  be  used  to  look  at  large  scale  trends  in 
the  field  structure  in  the  detector  screen. 

The  telephoto  lens  also  has  the  added  advantage  for  regular  field  mapping  applications  of 
allowing  the  IR  camera  to  be  located  far  away  from  the  object  under  test  and,  thus,  removing  any 
perturbing  effects  that  the  metallic  structure  of  the  camera  might  have  on  the  field  distribution  being 
measured. 

2.5.2  Thermal  Resolution 

The  thermal  resolution  of  the  IR  thermogram  is  a  function  of  the  digitizer  in  the  IR  camera.  The 
IR  camera  used  to  take  the  IR  thermograms  has  a  1 2  bit  digitizer.  For  a  1 2  bit  digitizer,  the 
temperature  range  seen  by  the  IR  camera  is  divided  into  256  increments.  Each  digitized  increment  is 
assign  a  unique  color,  resulting  in  a  temperature  resolution  of  256  color  levels. 

2.5.3  Polarization  Dependence 

The  magnitude  of  the  total  electric  or  magnetic  field  at  each  point  on  the  detector  screen  can 
be  measured  by  the  IR  technique,  however,  no  phase  information  is  available.  Both  polarizations  in  the 
plane  of  the  screen  are  combined  to  produce  the  total  magnitude  of  the  field.  One  polarization  can  be 
measured  if  the  detector  screen  is  striated  into  a  large  number  of  thin,  parallel  strips  of  lossy  material 
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each  separated  by  a  lossless  material,  eg.  air.  Only  the  component  of  the  field  in  the  direction  of  the 
stripes  will  be  measured  with  this  detector  material  configuration.  The  striated  detector  screen  can 
be  rotated  90°  to  pick-up  both  components  of  the  field  in  the  plane  of  the  screen. 

2.5.4  Thermogram  Errors 

The  resulting  IR  image  of  an  EM  field  depends  on  the  combined  EM  and  thermal  properties  of 
the  detector  material  and  is  subject  to  several  significant,  but  controllable,  errors. 

2.5.4. 1  Lateral  Conduction  Effects 

Conductive  heating  in  the  transverse  direction  within  the  screen  material  causes  thermal 
"bleeding"  from  the  hot  spots  on  the  screen  to  nearby  cold  spots.  This  thermal  bleeding  tends  to  fill 
in  the  nulls  (minimums)  somewhat,  whereas,  the  areas  of  maximum  heating  (peaks)  are  not  effected 
very  much  by  this  effect.  This  effect  can  be  minimized  by  operating  at  small  temperature  variations 
above  ambient. 

2. 5.4. 2  Lateral  Convective  Effects 

Convective  heating  of  the  top  of  the  screen  from  heat  rising  from  the  bottom  of  the  screen 
causes  the  top  of  the  screen  to  appear  slightly  hotter  than  the  bottom  of  the  screen.  This  "blurring" 
of  the  image  can  be  kept  to  a  minimum  by  operating  at  small  temperature  variations  above  ambient. 

This  blurring  effect  can  be  eliminated  completely  by  placing,  the  IR  detector  screen  in  a 
horizontal  position  and  observing  the  image  with  the  IR  camera  looking  down  on  the  screen  from  above 
or  from  the  side  using  an  IR  mirror. 

2.5.5  IR  Measurement  Accuracy 

The  accuracy  of  the  IR  measurement  technique  can  be  demonstrated  by  performing  two  simple 
experiments  with  known  theoretical  solutions.  In  one  experiment,  as  shown  in  figure  /,  the  diffraction 
pattern  from  a  "Lloyd's  Mirror”  was  measured.  In  the  other  experiment,  as  shown  in  figure  2,  the 
diffraction  pattern  from  an  "knife  edge"  conducting  half-plane  was  measured. 

In  the  Lloyd's  mirror  experiment,  the  resulting  diffraction  pattern  is  due  to  the  antenna 
interfering  with  its  image  in  a  large  ground  plane.  The  second  peak  in  figure  1  is  smaller  in  magnitude 
than  the  first  peak  due  to  the  fact  that  the  radiation  pattern  of  the  horn  antenna  used  in  the  experiment 
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decreases  with  angle  from  the  bore  sight  direction  of  the  horn.  The  near  field  {Fresnel  Zone)  antenna 
pattern  of  the  horn  antenna  was  used  to  obtain  the  theoretical  results. 

In  both  experiments,  the  screen  material  was  optimized  to  measure  only  the  tangential 
component  of  the  electric  field  intensity  in  the  plane  of  the  screen. 

These  experiments  were  performed  in  an  anechoic  chamber,  as  shown  in  figures  la  and  2a. 
Good  correlations  between  theory  and  experiment  were  obtained,  as  shown  in  figures  1b  and  2b.  The 
worst  errors  occurred  in  the  minimums  (deep  nulls)  of  the  diffraction  patterns  were  thermal  bleeding 
from  the  surrounding  hot  areas  tended  to  obscure  the  real  depth  of  the  minimums.  Some  thermal 
bleeding  out  of  the  maximums  into  the  surrounding  areas  also  occurred,  obscuring  the  real  height  of 
the  maximums. 

Even  with  conductive  bleeding  and  convective  blurring  of  the  image,  the  measurement  error 
is  less  than  approximately  10%  under  controlled  test  conditions. 

2.6  IR  Advantages  and  Disadvantages 

The  IR  measurement  technique  provides  a  quick  and  accurate  method  to  observe  EM  fields  in 
a  two-dimensional  plane.  However,  only  the  magnitude  of  the  electric  or  magnetic  field  is  measured; 
no  phase  information  is  detected.  Also,  since  this  technique  is  based  on  the  thermal  mass  of  the 
detector  material,  high  energy  is  required  to  produce  good  thermal  images  of  EM  fields. 

3.  IR  Thermograms  ...  Scattering  from  and  Coupling  into  a  Cyiinder 

As  an  example  of  the  IR  measurement  of  electric  fields,  a  right  circular  cylinder,  containing  a 
long,  thin  slot  aperture  in  its  side,  was  irradiated  with  a  plane  EM  wave.  IR  thermograms  were  made 
of  the  diffraction  pattern  of  the  EM  field  scattered  from  the  cylinder  and  the  cylindrical  modes  excited 
inside  the  cavity.  The  experimental  setup  is  shown  in  figures  3a  and  3b. 

Measurements  on  a  finite  length  cylinder  (3D  images)  and  theoretical  predictions  for  an  infinite 
length  cylinder  (2D  cross-sectional  analyses)  were  made  and  correlated. 

The  experimental  cylinder  is  one  meter  in  length  and  has  an  inner  diameter  of  approximately 
10  centimeters.  The  rectangular  slot  aperture  is  64  millimeters  in  width  and  is  10  centimeters  in 
length.  The  slot  is  located  in  the  middle  of  the  cylinder  and  is  oriented  parallel  to  the  axis  of  the 
cylinder.  The  cylinder  was  Irradiated  with  microwave  energy  at  3  GHz  (10  centimeter  wavelength) 
from  a  pyramidal  horn  antenna,  polarized  in  the  circumferential  direction  of  the  cylinder.  Cylindrical 
TE  modes  were  predominantly  excited  by  the  wave  polarization  inside  the  cavity. 

A  large  IR  detector  screen  of  carbon  paper  was  positioned  in  the  radial  plane  that  intersected 
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the  middle  of  the  slot  aperture.  Another  IR  disk  was  positioned  inside  the  cylinder  in  the  same  cross- 
sectional  plane. 

Experimentally  obtained  IR  thermograms  of  the  results  are  shown  in  figures  4  through  6.  The 
brightness  of  each  color  in  the  image  corresponds  to  the  intensity  of  the  EM  field. 

Thermograms  of  the  incident  microwave  field  of  a  horn  antenna  are  shown  in  figure  4,  Figure 
4a  is  an  image  of  the  electric  field  traveling  wave  in  the  longitudinal  plane  in  front  of  the  horn.  Figure 
4b  is  an  image  of  the  electric  field  in  the  transverse  (cross-sectional)  plane  one  meter  in  front  of  the 
aperture  of  the  horn. 

Thermograms  of  the  field  scattered  from  a  cylinder  are  shown  in  figure  5.  Figure  5a  is  an 
image  of  the  electric  field  standing  wave  created  between  the  horn  and  the  cylinder.  This  thermogram 
shows  the  interference  pattern  between  the  incident  wave  and  the  scattered  cylindrical  wave.  Figure 
5b  is  an  image  of  the  electric  field  traveling  wave  in  the  shadow  zone  of  the  cylinder.  The  null  behind 
the  cylinder  and  the  diffracted  wave  off  the  top  and  bottom  of  the  cylinder  can  be  seen  in  this 
thermogram. 

Thermograms  of  the  induced  modes  coupled  into  the  cylindrical  cavity  are  shown  in  figure  6. 
Figure  6a  is  an  image  of  the  electrical  field  coupled  through  the  aperture  for  a  frequency  1 0%  below 
the  cutoff  frequency  of  the  cylindrical  waveguide.  The  EM  energy  coupled  through  the  aperture  is 
visible  in  this  thermogram  and,  as  expected,  has  the  radiation  pattern  of  an  electric  dipole.  The 
dominant  TE^  waveguide  modal  pattern  is  partially  developed  in  the  center  of  the  waveguide.  Figure 
6b  is  an  image  of  the  electrical  field  coupled  through  the  aperture  for  a  frequency  10%  above  the 
cutoff  frequency  of  the  cylindrical  waveguide.  The  dominant  TE^  waveguide  modal  pattern  is  now 
fully  developed  in  the  center  of  the  waveguide. 

The  diffraction  patterns  of  the  EM  fields  scattered  from  the  cylinder  and  the  modal  patterns  of 
the  induced  cylindrical  cavity  modes  are  clearly  indicated  in  these  figures. 

4.  Conclusion 

The  IR  measurement  technique  is  a  viable  method  to  aid  in  the  determination  of  EM  fields 
scattered  from  complex  metallic  objects  and  the  EM  energy  coupled  into  complex  cavity  structures. 
This  method  is  of  particular  importance  in  the  study  of  scattering  surfaces  with  complicated 
geometrical  shapes,  whose  patterns  of  surface  current  distributions  may  not  be  found  easily  using 
theoretical  methods. 

The  IR  method  allows  for  rapid  observation  of  EM  field  activity  and  interference,  resulting  in 
an  in-depth  understanding  of  the  EM  scattering  and  coupling  phenomena.  Qualitative  and  quantitative 
comparisons  can  be  made  between  the  fields  measured  using  the  thermal  radiation  experimental 
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approach  and  the  fields  predicted  using  a  theoretical/numerical  approach.  Experimental  and  theoretical 
data,  therefore,  can  be  easily  correlated  with  this  technique. 

5.  Future  Work 

The  development  of  a  suitable  magnetic  detection  screen  is  in  progress.  The  use  of  spin-on 
magnetic  thin  films  is  being  investigated. 

The  calibration  of  the  IR  thermogram  images  to  determine  the  absolute  magnitude  of  the 
electric  or  magnetic  field  intensity  is  also  in  progress.  Several  empirical  tests  have  been  performed 
with  a  standard  gain  horn  aperture  antenna  to  correlate  the  intensity  of  the  EM  power  of  the  radiated 
wave  incident  on  the  detector  screen  with  the  intensity  of  the  measured  temperature  (color)  of  the 
screen  on  a  pixel  by  pixel  basis. 

It  may  also  be  possible  to  estimate  the  phase  of  the  incident  wave  by  making  a  numerical 
model  of  the  thermal/electromagnetic  interaction  in  the  detector  screen  material,  and  to  used  energy 
minimization  techniques  to  estimate  the  phase  of  the  incident  wave.  This  work  is  being  done  in 
conjunction  with  Syracuse  University  (Prof.  Tapan  Sakar). 

The  imaging  technique  is  also  being  used  to  map  fields  around  a  simple  generic  model  of  an 
aircraft.  The  IR  thermograms  of  the  scattered  microwave  energy  will  be  used  to  verify  the  accuracy 
of  the  numerical  code.  The  particular  code  being  tested  is  the  GEMAC  code  developed  at  Rome 
Laboratory. 

6.  Publications 

The  papers  published  on  this  project  during  the  AFOSR  Summer  Research  Program  are  listed 
in  the  Appendix. 
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(a)  (b) 

Fig.  4  IR  thermogram  of  a  microwave  horn  radiation  pattern:  (a)  in  the  longitudinal  plane  (in  a  vertical 
plane  through  the  center  of  the  horn  aperture)  and  (b)  in  the  transverse  plane  (a  cross-sectional  cut 
of  the  microwave  beam  1  m  in  front  of  the  plane  of  the  aperture). 


Fig.  5  Diffraction  pattern  of  the  scattered  electric  field  from  a  finite  cylinder:  (a)  standing  wave  between 
the  horn  and  the  cylinder  (showing  the  constructive  and  destructive  interference  patterns  between  the 
incident  and  the  reflected  waves)  and  (b)  traveling  wave  behind  the  cylinder  (showing  the  shadow 
zone  and  the  surface-diffracted  waves). 


Fig.  6  Excited  cylindrical  waveguide  cavity  modes.  (The  long,  thin,  horizontal  slot  aperture  Is  centered 
on  the  left-hand  side  of  the  IR  thermograms.  A  round  IR  detection  screen  is  placed  inside  the  cylinder. 
Note  the  reflection  In  the  waveguide  walls.)  (a)  excitation  of  an  evanescent  mode  (at  a  frequency  10% 
below  the  cutoff  frequency  of  the  cylindrical  waveguide)  and  (b)  excitation  of  the  dominant  TEi^  cylln- 
drrcal  waveguide  mode  (at  a  frequency  10%  above  the  cutoff  frequency  of  the  cylindrical  waveguide). 
Note  the  dipole  pattern  at  the  aperture  predicted  by  Bethe  hole  coupling  models. 
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ANYTIME  INFERENCE  AND  DECISION  METHODS 


Michael  Pittarelli 
Associate  Professor 
Computer  Science  Department 
SUNY  Institute  of  Technology 


Abstract 

We  examine  methods  of  decision  making  that  are  able  to  accommodate  limitations  on  both  the 
form  in  which  uncertainty  pertaining  to  a  decision  problem  can  be  realistically  represented  and 
the  amount  of  computing  time  available  before  a  decision  must  be  made.  The  methods  are 
anytime  algorithms  in  the  sense  of  Boddy  and  Dean  [1989].  An  anytime  adaptation  of 
Nilsson's  [1986]  probabilistic  logic  is  developed.  Decision  making  techniques  are  presented  for 
use  with  this  system  of  logic,  with  Frisch  and  Haddawy's  [1992]  anytime  deduction  system, 
and  with  a  probabilistic  database  model. 
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ANYTIME  INFERENCE  AND  DECISION  METHODS 


Michael  Pittarelli 


1.  Anytime  Algorithms  for  Decision  Making 

An  algorithm  qualifies  as  an  anytime  algorithm  [Boddy  and  Dean.  1989]  if,  minimally,  it  is 
interruptable  in  such  a  way  that  it  provides  usable  output  at  any  time  it  is  interrupted  and  if 
the  expected  quality  of  the  output  improves  in  some  sense  with  increasing  execution  time. 
(Additional  properties  are  sometimes  assumed,  for  example,  that  the  marginal  Improvement  in 
the  quality  of  the  output  is  nonincreasing.)  For  a  decision  problem  under  risk,  where  the 
probability  of  each  of  the  conditions  is  known  and  is  represented  as  a  single  real  number .  an 
anytime  decision  algorithm  might  consider  the  actions  in  some  arbitrary  sequence  and  output 
at  each  step  the  action  with  highest  expected  utility  among  those  so  far  examined.  The  higher 
the  expected  utility  of  the  action  identified  as  best  by  a  particular  time,  the  better.  Clearly,  an 
action  identified  as  best  (among  all  those  examined  so  far)  at  a  later  time  will  have  higher 
expected  utility  than  one  identfied  as  best  at  an  earlier  time.  (However,  the  improvement  in 
expected  utility  will  not  be  a  nonincreasing  function  of  computing  time.) 

A  more  realistic  representation  of  uncertainty  is  by  means  of  intervals  of  real  numbers 
[Kyburg.  1992].  A  reasonable  decision  criterion  interprets  the  intervals  as  linear  inequality 
constraints  determining  a  set  of  real-valued  probability  functions  over  the  conditions  and 
identifies  an  action  as  inadmissible  if  there  does  not  exist  an  element  of  the  set  relative  to 
which  the  action  maximizes  expected  utility  [Levi.  1980].  This  may  be  determined  for  each 
action  by  checking  for  the  existence  of  a  feasible  solution  to  a  linear  program  [Pittarelli.  1991]. 
Any  action  not  yet  ruled  out  as  inadmissible  by  some  time  may  be  classified  (possibly 
incorrectly)  as  admissible  relative  to  the  current  collection  of  probability  intervals.  At  a  higher 
level,  the  set  of  intervals  may  be  iteratively  refined  (e.g..  for  confidence  intervals,  by 
decreasing  the  confidence  level  [Loui.  1986])  and  the  process  of  testing  for  inadmissibility 
repeated  for  each  refinement,  for  the  elements  of  the  most  recently  computed  admissible  set. 

With  this  sort  of  constraint  on  probabilities  over  conditions,  the  fewer  actions  identified  as  E- 
admissible.  the  better.  As  processing  continues,  tighter  constraints  reveal  certain  actions  to  be 
inadmissible.  An  action  ruled  out  as  inadmissible  at  one  step  will  not  be  identified  as 
admissible  at  any  later  step.  Hence,  at  later  stages,  there  are  fewer  actions  among  which 
choice  using  criteria  other  than  E-admissibility  (e.g..  maximin,  leximin.  minimax  regret,  etc.) 
must  be  made.  (Note  that  both  the  maximum  and  the  minimum  expected  utility  among 
actions  in  the  admissible  set  decrease  with  additonal  computing  time.) 

A  system  for  "anytime  deduction"  has  been  developed  by  Frisch  and  Haddawy  [1992].  In  this 
system,  an  interval  of  probability  for  a  conclusion  is  determined  from  a  collection  of  premises 
each  of  which  itself  has  an  associated  probability  interval.  The  interval  of  probability  for  the 
conclusion  is  initially  [0,1]  and  is  narrowed  with  each  application  of  a  rule  of  inference.  This 
approach  differs  from  [Nilsson,  1986]  which  requires  the  construction  of  a  potentially  very 
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large  linear  program  for  determining  the  endpoints  of  the  probability  interval  for  the  entailed 
sentence.  However,  as  explained  in  detail  later  in  this  report,  it  is  straightforward  to  produce 
an  anytime  inference  technique  by  an  adaptation  of  Nilsson's  methods  that  considers  only 
subsets  of  the  given  set  of  premises.  We  will  discuss  methods  of  anytime  decision  making 
suitable  for  both  forms  of  probabilistic  inference. 

Probabilistic  databases  [Pittarelli.  1994].  that  is.  collections  of  contingency  tables  of  joint 
frequencies  or  (real-  or  interval-valued)  probabilities  for  finite  variables,  may  also  provide  linear 
constraints  relevant  to  a  decision  problem.  There  is  again  a  tradeoff  between  the  tightness  of 
the  constraint  that  can  be  inferred  and  the  cost  of  doing  the  inference.  We  will  discuss 
methods  of  anytime  decision  making  utilizing  probabilistic  databases  also. 

2.  Anytime  Deduction  and  Decisions 

Frisch  and  Haddawy  [1992]  have  developed  a  system  of  deduction  for  probabilistic  logic  based 
on  inference  rules.  The  rules  may  be  employed  to  compute  increasingly  narrow  probability 
intervals  for  the  conclusion  of  an  argument.  At  any  time,  the  currently  computed  interval  is 
correct  in  the  sense  that  it  contains  the  narrowest  interval  computable  from  the  probabilities 
associated  with  all  of  the  premises.  Thus,  there  is  a  tradeoff  between  the  precision  of  an 
entailed  probability  interval  and  the  time  required  to  compute  it.  This  feature  makes  anytime 
deduction  especially  suitable  for  use  by  "resource-bounded"  systems  [Horvitz  et  al.  1989]:  as 
Frisch  and  Haddawy  point  out.  however,  how  to  control  the  time/precision  tradeoff  depends 
on  the  particular  decision  situation  in  which  the  system  finds  itself. 

Frisch  and  Haddawy *s  anytime  deduction.  Nilsson's  probabilistic  logic  (unless  maximum 
entropy  or  related  techniques  are  used),  and  related  systems,  produce  probability  intervals  for 
entailed  sentences.  A  criterion  applicable  to  decision  problems  in  which  probabilities  are  given 
as  intervals  and  that  reduces  to  standard  maximization  of  expected  utility  when  the  intervals 
reduce  to  point  values  is  Levi's  E-admissibility  criterion  [Levi.  1980):  ail  and  only  those  actions 
maximizing  expected  utility  relative  to  some  member  of  the  set  of  (point  valued)  probability 
functions  compatible  with  (or  actually  representing  [Kyburg.  1992])  current  beliefs  are 
admissible.  (Such  actions  are  also  referred  to  as  Bayes  solutions.) 

We  will  consider  decision  problems  consisting  of  a  set  of  actions  A  =  {ai . a^}  and  a  set 

of  mutually  exclusive  and  exhaustive  conditions  C  =  {ci . c„}  such  that 

p{cj  \ai)  =  p{cj).  for  all  i.j. 

(Jeffrey  [1976]  shows  how  a  problem  in  which  conditions  are  not  probabilistically  independent 
of  actions  can  be  converted  to  an  equivalent  problem  in  which  they  are.) 

A  family  D  of  subsets  of  the  (/i— l)-dimensional  simplex  Pc  of  all  probability  distributions 
over  C  may  be  defined  as 

D  =  {D{a,) . D{a„)}. 

where 

^(a/)  =  {pePcl  S  p{cj)xU{ai.Cj)  > 

n 

S  p{^j)'><^u{ak.cj).k=l . m}. 

y=i 
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D  (a,)  is  the  set  of  probability  functions  relative  to  each  of  which  action  a,-  maximizes  expected 
utility,  and  is  referred  to  as  the  domain  of  a/  [Starr,  1966].  Each  domain  is  convex;  domains 
may  intersect  at  faces:  and  U  D  (a)  =  Pc- 

Suppose  the  current  belief  state  regarding  probabilities  of  conditions  is  represented  as  KQPc- 
An  action  a  is  E-admissible  if  and  only  if  K’nD(a)  =#=  0.  The  number  of  admissible  actions 
decreases  monotonically  with  the  size  of  K: 

KiCKi  KinD(a/)C  K2nD{ai). 

Thus,  there  will  be  a  tradeoff  between  the  quality  of  a  decision  —  i.e..  the  number  of 
admissible  actions  among  which  choice  must  be  made  using  a  criterion  other  than  E- 
admissibility  —  and  the  computational  expense  of  shrinking  K.  Reduction  in  the  size  of  K  may 
be  achieved  by  applying  additional  inference  rules;  by  adding  premises,  thereby  enlarging  the 
linear  constraint  system:  by  "extending"  and  "projecting"  larger  and  larger  portions  of  a 
probabilistic  database;  etc.  (Of  course,  the  perceived  quality  of  a  decision  may  be  enhanced 
without  this  much  work:  invoke  the  maximum  entropy  principle.  This  approach  will  be 
criticized  below.)  Computing  aimed  at  reducing  the  number  of  admissible  actions  may  be 
interleaved  with  analysis  of  the  type  proposed  by  Horvitz  et  al.  [1989]  to  determine  whether 
the  cost  of  such  computation  exceeds  its  expected  value. 

Consider  a  decision  involving  the  two  actions  "Go  to  the  beach"  and  "Do  not  go  to  the  beach" 
the  outcome  of  which  is  contingent  on  the  truth  or  falsity  of  the  probabilistically  entailed 
sentence  "It  will  rain  this  afternoon".  Utilities  of  the  four  possible  outcomes  are: 

U{Go.  Rain)  =  0, 

U{Go.  No  rain)  =  1. 
l/(Do  not.  Rain)  =  0.8, 

U(Do  not.  No  rain)  =  0.2. 

The  following  propositions  and  associated  probability  intervals  are  available: 

(1)  p(Temperature  >  85)  C  [  95.1] 

(2)  p(Humidity  >  80)  e  [  95.1] 

(3)  p{B.  pressure  <  30)  G  [.95.1] 

(4)  p((B.  pressure  <  30  Humidity  >  80)  Rain)  G  [.65..95] 

(5)  p(Temperature  >  85  ^  Rain)  G  [•4..6] 

(6)  pjcioudy  L  Windy)  G  [.75.1] 

(7)  p(Cloudy  B.  pressure  <  30)  G  [.5..8]. 

Both  "Go  to  the  beach"  and  "Do  not  go  to  the  beach"  have  non-empty  domains:  "Go" 
maximizes  expected  utility  when  p(Rain)  <  0.5:  "Do  not"  does  when  p(Rain)  >  0.5.  Neither 
can  be  ruled  out  at  this  stage.  However.  Frisch  and  Haddawy’s  probabilistic  inference  rules 
may  be  applied  one-at-a-time  to  narrow  the  interval  for  p(Rain)  until  a  single  admissible  action 
emerges,  or  it  is  no  longer  economical  to  continue  refining  (e.g..  the  last  train  to  the  beach  is 
about  to  leave)  and  a  choice  among  the  admissible  actions  must  be  made  using  some  other 
criterion  (e.g..  choose  at  random  [Elster.  1989],  use  maximin.  maximize  expected  utility 
relative  to  the  midpoint  of  the  probability  interval,  etc.). 

A  simple  strategy  for  selecting  portions  of  the  database  to  which  to  apply  the  inference  rules 
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suggests  itself:  Begin  by  applying  the  'Trivial  derivation'  rule. 

|-p(a|^)€l0.1]. 

with  a  instantiated  to  each  target  proposition.  Then  add  ail  sentences  containing  as  a 
component  one  of  the  target  propositions.  If  the  target  intervals  are  insufficiently  narrow  to 
eliminate  all  but  one  of  the  actions,  add  the  sentences  containing  the  other  propositions 
contained  in  the  first  set  of  sentences,  and  so  on.  This  can  be  refined  to  the  addition  of  just 
one  such  sentence  at  each  step. 

In  the  example,  there  is  one  target  proposition,  and  the  trivial  derivation  rule  gives 

(8)  p(Rain)  e  [0.1]. 

Sentences  (4)  and  (5)  contain  the  statement  'Rain'.  Trivial  derivation  can  be  applied  to  the 
antecedents,  which  in  turn  can  be  combined  with  (8)  via  'Forward  implication  propagation'. 

€  [x.y\.  €  [w.  v]  |-  p(a|^)  6  [maA(0.jr+i/-l).  vj. 

to  yield  two  more  intervis  for  p(Rain): 

(9)  p(Rain)  G  [0..95]. 

(10)  p(Rain)  G  [0..6]. 

Sentence  (10)  is  again  produced  when  (8).  (9).  and  (10)  are  combined  using  the  'Multiple 
derivation'  rule: 

p(a|^  G  [x.y],  p(o!|^)  G  [o.  v]  |-  p(a|^  G  [max{x.u).  min{y.v)\. 

Use  of  this  rule  is  necessary  to  maintain  the  tightest  determinable  interval  at  each  step  for  the 
target  sentence  and  is  essentially  what  gives  Frisch  and  Haddawy's  system  its  anytime 
character. 

Both  actions  remain  admissible  at  this  point. 

There  are  no  more  sentences  containing  'Rain'  as  a  component.  We  now  look  at  those 
containing  'Barometric  pressure  <  30'.  'Humidity  >  30'.  and  'Temperature  >  85'. 
'Conjunction  introduction'. 

p (a  I  ^  G  \x.y\,  p (^  I  C  [u.  v]  |-  p (a  /?  j  ^  E  [max (0.x  +u  — 1).  min {y.  v)] 
may  be  applied  to  statements  (2)  and  (3).  resulting  in 

(11)  p(B.  pressure  <  30  &  Humidity  >  80)  G  [.9.1]. 

Applying  forward  implication  propagation  to  statements  (4)  and  (11)  gives 

(12)  p(Rain)  G  [.55..95] 

Although  combining  statement  (12)  with  statement  (10)  via  the  multiple  derivation  rule  will 
further  narrow  the  target  interval,  there  is  no  need  to  do  so:  nor  is  there  any  need  to  consider 
statements  (1).  (6).  and  (7).  'Do  not  go'  has  emerged  as  uniquely  admissible: 

£>(Do  not  go)  =  {pGP{/?a//i.  No  rain}  |  p(Rain)  >  .5}. 

£>(Go)  =  {pEP{/?a/„.  No  rain}  |p(Rain)  <  .5}. 

0(Go)n'{pGP^/?a//j_  No  rain}  |  p(Rain)G[.55,.95]}-  =  0. 

D(Do  not  go)  n  {peP^Rain.  No  rain}  |  p(Rain)G[.55..95]}  =»f=  0. 

3.  Nilsson’s  Probabilistic  Logic  and  Decision  Making 

Nilsson  s  methods  may  be  modified  to  yield  an  anytime  procedure  for  decision  making.  Rather 
than  construct  the  linear  system  corresponding  to  the  full  set  of  sentences,  increasingly  larger 
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systems  may  be  constructed  by  adding  sentences  to  the  subset  currently  in  use  until  a 
uniquely  admissible  action  emerges  or  it  is  necessary  to  choose  among  the  currently  admissible 
actions. 

This  may  be  illustrated  with  the  sentences  and  decision  problem  above.  Suppose  sentences  (2) 
and  (4)  are  chosen  for  the  first  iteration.  Using  Nilsson's  "semantic  tree"  method,  five  sets  of 
possible  worlds  are  identified  for  these  sentences  plus  "Rain":  those  in  which  all  three  are 
true:  those  in  which  the  implication  is  true,  but  "Rain"  and  "B.  pressure  <  30"  are  false;  etc. 

Both  actions  are  E-admissible.  "Go"  is  E-admissible  because  there  exist  feasible  solutions  to 
the  system  of  linear  inequalities  below,  where  p/  is  the  probability  of  set  w/  of  possible  worlds: 

"Rain"  is  true  in  sets  wj  and  ws.  "Humidity  >  80"  is  true  in  sets  w^.  and  m^.  etc.: 

P1+P2+P3+P4  +P5  =  1 

P2+P3+P4+P5^  0-65 
P2+P3+P4+P5<  0.95 
Pi  +  P4  +  P5  >  0.95 

{P3  +  Ps)  xO  +  (Pl  +  P2  +  P4)  xl  >  (P3  +  Ps)  xO.8  +  (pi  +  P2  +  P4)  XO.2 

"Do  not  go"  is  also  E-admissible.  since  the  system  resulting  from  reversing  the  direction  of  the 

final  inequality  also  has  feasible  solutions. 

Now  add  sentence  (3).  The  resulting  8  sets  of  possible  worlds  may  be  determined  by 
expanding  only  the  "live"  terminal  nodes  of  the  semantic  tree  constructed  at  the  first  iteration. 

(To  eliminate  the  need  for  a  row  interchange,  the  root  of  the  initial  tree  should  represent  the 
target  sentence.  One  may  proceed  in  this  way  until  it  is  no  longer  necessary  to  continue, 
possible  to  continue,  or  worth  continuing.  If  the  number  of  sets  of  worlds  generated  becomes 
excessive.  Snow's  compression  method  [1991]  may  be  attempted.)  Do  not  go  is  now 
identified  as  uniquely  E-admissible;  there  exist  feasible  solutions  to  the  system  below,  but  not 
to  the  corresponding  system  for  "Go": 

Pl+P2+P3+P4+P5+P6+P7+P8=l 
P1+P2+A^+P4+P5+P6+P7>  0-65 
P1+P2+P3+P4+P5+P6+P7<  0-95 
Pi+P2+P6+P8>  0-95 
Pl+P3+P5+P8>  0-95 

(Pl  +  P2  +  P3  +  P4)  xO.8  +  (P5  +  P6  +  P7  +  Pb)  xO.2  >  (Pi  +  P2  +  P3  +  P4)  xO  +  (P5  +  P6  +  P7 


4.  Decisions  with  Multiple  Conditions 

Frisch  and  Haddawy's  system  is  applicable  to  decision  problems  with  an  arbitrary  number  n  of 

mutually  exclusive  conditions  cy.  The  1)+1)  statements 

p(civ  •  •  •  vc„)  e  [1.1] 
p(ci«fec2)  e  [0.0] 


p{c„-iicc„)  €  [0.0] 

must  be  included.  Intervals  must  be  maintained  for  each  of  the  conditions  cj.  The  soundness 
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of  Frisch  and  Haddawy's  inference  rules  guarantees  that,  at  any  time,  the  interval  [/y.uy] 
associated  with  any  cy  is  a  superset  of  the  tightest  interval  entailed  (algebraically)  by  the  full 
collection  of  sentences.  Thus,  the  sharpest  intervals  available  at  any  time  yield  a  linear 
system  from  which  it  can  be  determined  whether  an  action  would  not  be  E-admissible  relative 
to  the  sharper  probability  bounds  computable  at  any  later  time:  action  a,-  is  (ultimately) 
admissible  only  if  there  exist  feasible  solutions  to 
p(ci)  +  •  •  •  +  p(c„)  =  1 
P(ci)  >  h 
P(ci)  <  th 


P(C/i)  >  4i 
P(c„}  <  u„ 

p(ci)  X  l/(a/,ci)  +  •  •  •  +  p[c„)x  U[ai,c„)  >  p{ci)xU{ai.Ci)  +  •  ■  •  +  p[c„)x  U{ai,c„) 


p{ci)xU{ai.ci)  +  •••  +p(c„)xfy(a„c„)  >p(ci)xfy(a^,ci)  +  •••  +  p(c„) x ty(a;„,c„), 
where  /y  and  t/y  are  the  current  bounds  on  p(cy). 

Information  may  be  lost  if  probability  intervals  are  computed  separately  for  each  of  the 
conditions  in  a  decision  problem  with  more  than  two  conditions.  For  a  given  set  K  C  of 
probability  distributions,  let  us  refer  to  the  solution  set  of  the  linear  system  obtained  by 
combining  the  additivity  constraint  with  the  inequalities  corresponding  to  the  tightest 
probability  bounds  inferable  from  the  constraints  determining  K  as  the  envelope  of  K.  There 
are  naturally  arising  convex  sets  K  of  probability  distributions  over  n  >  2  conditions  that  are 
proper  subsets  of  their  envelopes.  The  intersection  of  the  domain  of  an  action  with  K  may  be 
empty,  although  its  intersection  with  the  envelope  of  K  is  not.  Thus,  actions  that  are  not  E- 
admissible  may  not  be  identified  as  such,  resulting  in  unnecessary  indeterminateness. 

Nilsson’s  semantic  tree  method  can  be  adapted  to  take  into  account  the  mutual  exclusivity  and 
exhaustiveness  of  multiple  (i.e.,  more  than  two)  conditions  in  a  decision  problem.  The  first  n 
levels  of  the  tree  will  correspond  to  the  n  conditions.  (This  facilitates  the  anytime  adaptation 
of  Nilsson  s  methods  discussed  above.)  At  level  n  there  will  be  n  live  nodes,  one  for  each  of 
the  assignments  in  which  exactly  one  of  the  conditions  is  true.  The  remaining  levels  of  the  tree 
are  constructed  as  usual.  For  example,  with  conditions  c^.  C2  and  C3,  an  arbitrary  number 
nfi  >  2  of  actions  a,-,  and  data  p(P— >ci)  G  [0.9,1]  and  p{B)  €  [0.8,1],  there  are  6  sets  of 
possible  worlds,  corresponding  to  the  matrix 

1  1  0  0  0  0  (ci) 

0  01  10  0  (C2) 

0  0  0  01  1  (C3) 

111010  (fi-»ci) 

100101  (B) 

Action  3/  is  E-admissible  iff  there  exist  feasible  solutions  to  the  system  of  linear  inequalities: 
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Pi  +  •  •  •  +  P6  =  1 

P1+P2+P3+P5>0.9 

Pi  +  P4  +  P6  >  0.8 

(Pi  +  P2)^U{ai.Ci)  +  (P3  +  P4)xl/(a/.C2) 

+  (P5  +  Pejx^U'Ca)  > 

(Pi  +  P2)xt/(ai.ci)  +  (P3  +  P4)xt^(ai.C2) 
+  (P5  +  Pejxf^Cai.ca) 


(Pi  +  P2)xf/(a/.ci)  +  (P3  +  P4)x6/(a/.C2) 

+  (Ps  +  P6)xt/(a/.C3)  > 

(Pl  +  P2)xf/(aOT.Ci)  +  (P3  +  P4)xf/(am.C2) 

+  (P5  Pe)  X  i^(anj.C3) 

5.  Maximum  Entropy  and  Probabilistic  Logic 

Nilsson  [1986]  shows  how  to  maximize  entropy  within  the  set  of  probability  distributions  over 
the  possible  worlds  in  order  to  compute  a  point-valued  probability  for  an  entailed  sentence. 
The  maximum  entropy  estimate  of  the  probability  of  the  entailed  sentence  is  the  sum  of  the 
components  of  the  maximum  entropy  distribution  corresponding  to  the  worlds  in  which  the 
sentence  is  true.  Point-valued  probabilities  for  each  of  the  conditions  in  a  decision  problem  are 
computable  also  from  the  distribution  over  the  possible  worlds  maximizing  entropy. 

If  an  action  maximizes  expected  utility  relative  to  the  maximum  entropy  estimate,  it  is 
guaranteed  to  be  E-admissible  relative  to  any  set  K  of  distributions  to  which  the  estimate 
belongs.  But.  of  course,  the  converse  does  not  hold.  E-admissible  actions  that,  depending  on 
one’s  philosophy  of  decision  making,  perhaps  should  be  retained  for  further  consideration,  are 
eliminated.  It  may  be  that  one  of  these  actions  uniquely  maximizes  expected  utility  relative  to 
the  (pace,  inter  alios,  DeFinetti)  "correct"  element  of  K. 

To  illustrate,  suppose  that,  after  looking  closely  at  the  evidence,  an  expert  comes  finally  to  the 
conclusion  that  p{P)  =  0.2  and  p{P-^Q)  —  0.9.  Suppose  that  his  conclusion  is  final  also  in 
the  sense  that  he  expires  immediately  upon  recording  it.  Unfortunately,  we  wish  to  make  a 
decision  the  outcome  of  which  depends  on  the  truth  of  the  proposition  Q.  Suppose  that 
A  =  {ai.a2}  and  that 
U(ai.O)  =  1.0 
U(ai.O)  =  0 
U(a2.0)  =  0.2 
U(a2,(3)  =  0.7 

The  range  of  probabilities  for  Q  compatible  with  the  expert's  recorded  opinions  is 

0.1  <  p(0)  <  0.9. 

The  maximum  entropy  estimate  (see  below)  is  0.5.  Action  ai  has  higher  expected  utility  than 
32  does  relative  to  this  estimate.  Suppose,  however,  that  our  expert  was  perfectly 
"calibrated"  and  that,  had  he  survived  long  enough  to  arrive  at  an  opinion  on  p(0)  itself,  it 
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would  have  been  p(0)  =  0.1.  Relative  to  this  distribution,  the  expected  utilities  of  and  32 
are  0.1  and  0.65  (utiles),  respectively:  32  has  by  far  the  greater  expected  utility  (expressed  in 
utiles).  Retaining  both  actions  as  E-admissible  relative  to  the  available  information  and  using, 
e.g.,  maximin  to  resolve  the  indeterminacy  (or  even  performing  a  mixture  giving  any  positive 
probability  of  selection  to  32)  will  result  in  higher  expected  utility  than  choosing  action  ai. 

If  the  maximum  entropy  distribution  at  least  tends  to  be  close,  on  some  metric,  to  the  actual 
distribution  over  the  worlds,  then  its  projection  will  tend  to  be  close  to  the  actual  probability 
distribution  over  the  conditions.  (But  note  that  the  result  of  marginalizing  the  maximum 
entropy  element  of  a  set  K  Is  not  always  the  maximum  entropy  element  of  the  set  of 
marginals  of  elements  of  K.)  The  closer  the  estimate  of  the  probabilities  of  the  conditions  is 
to  the  true  distribution,  the  likelier  it  is  that  it  will  belong  to  one  of  the  domains  containing  the 
true  distribution.  Thus,  the  likelier  it  is  that  an  action  maximizing  utility  relative  to  the  true 
distribution  will  be  selected. 

How  close  can  one  expect  the  maximum  entropy  estimate  to  be  to  the  true  distribution  over 
the  possible  worlds?  If  you  accept  Jaynes'  concentr3tion  theorem  [Jaynes.  1982],  i.e.,  if 
probabilities  are  observed  relative  frequencies  and  sequences  of  observations  are  equiprobable 
a  priori  and  the  number  of  observations  is  infinite  and  if  you  accept  difference  in  entropy  as  a 
distance  measure,  then  the  answer  Is  "very".  If  you  want  to  stick  with  metric  distances  and 
probabilities  are  allowed  to  be  subjective,  then  it  might  be  reasonable  to  ask  how  close  the 
maximum  entropy  element  is  to  the  centroid  of  the  set.  which  minimizes  expected  sum-of- 
squares  error  [MacQueen  and  Marschak.  1975],  but  is  more  expensive  to  calculate  [Piepel. 
1983]. 

When  the  set  of  distributions  is  either  a  singleton  or  the  full  probability  simplex,  the  maximum 
entropy  element  is  guaranteed  to  coincide  with  the  centroid.  It  always  coincides  with  the 
centroid  for  the  modus  ponens  inference  pattern:  Let  x  =  p(f*)  and  y  =  p(P^Q).  There  are 
four  sets  of  possible  worlds,  the  probabilities  of  which  are  the  solutions  to  the  system  of 
equations: 

Pl+P2+P3+P4  =  l 

P\  +  P2=  X 

P\+  Pz+  Pa  =  y- 

The  solution  set  is  either  a  single  point  (when  p(P)=l)  or  a  line  segment  In  [0.1]^  with 
vertices 

(Pi.P2.A^.P4)  =  (y-(l-x),(l-y),l-x.O) 

(Pl.P2.P3.P4)  =  (y-(l-x).(l-y).0.1-x). 

The  centroid  of  the  line  segment  is  the  average  of  the  vertices,  which  coincides  with  the 
maximum  entropy  distribution  calculated  by  Nilsson. 

This  will  not  always  be  the  case.  It  does  not  even  appear  that  one  can  expect  the  maximum 
entropy  estimate  to  be  especially  close  to  the  centroid  (which  one  cares  about  if  one  wishes  to 
minimize  expected  squared  error).  Consider  the  conjunction  pattern  of  inference:  from  A  and 
B.  infer  B.  There  are  four  sets  of  possible  worlds:  those  in  which  A&B  is  true,  those  in 
which  AdiB  is  true,  etc.  The  set  of  solutions  to  the  system  is  again  either  a  single  point  or  a 
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line  segment  in  [0,1]*.  Let  and  denote,  respectively,  the  greatest  lower  and 

least  upper  bounds  on  p{AiiB).  Ordering  components  as 

(p  {AiiB) .  p  {ASiB) .  p  (A&B)  .  p  (AiiB) ) . 
the  solution  set  has  vertices 

=  {l{ASiB).p{A)-l{AJiB).piB)-l{AS^B),{l-{p{A)+p{B)-/{ASiB))) 

=  {u{Ai^B).p{A)-u{AiiB).p{B)-u{AiiB).{i-{p{A)+p{B)-u{AiiB))). 

The  centroid  is  the  average  of  the  two  vertices; 

ce  =  (v^  +  v^)/2. 

The  maximum  entropy  element  coincides  with  the  distribution  computed  under  the  assumption 
of  probabilistic  independence  of  A  and  B: 

m  =  (p(>4)xp(fi).p(^)xp(fi),p(^)xp(5).p(^)xp(S)). 

The  eccentricity  of  an  element  of  any  non-unit  solution  set  K  is  the  ratio  between  its 
(Euclidean)  distance  from  the  centroid  and  the  maximum  distance  of  any  element  of  the  set 
from  the  centroid: 

ecc{p.K)  =  d{p.ce)/maxd{p.ce). 

p^K 

The  eccentricity  will  have  a  minimum  value  of  0  (when  p  =  ce)  and  a  maximum  value  of  1 
(when  p  is  a  vertex). 

For  conjunction  entailment.  it  is  possible  for  the  value  of  ecc{m.K)  to  be  quite  high.  For 
example,  when  p(>l)=0.9  and  p(S)=0.1.  ecc{m.K)=0.B.  The  expected  value  of  ecc{p.K)  for 
a  randomly  selected  element  p  of  /C  is  1/2.  Letting  <p(/4).p(fi)>  range  with  uniform 
probability  over  (0.1)^.  the  expected  value  of  ecc{m.K)  is  1/3.  So.  for  conjunction  entailment 
anyway,  one  cannot  expect  the  maximum  entropy  approximation  to  be  especially  low-risk. 

Kane  [1990.  1991]  has  developed  a  method  of  computing  the  maximum  entropy  solution  that 
is  faster  than  that  proposed  by  Nilsson.  Deutsch-McLeish  [1990]  has  determined  conditions 
under  which  Nilsson’s  projection  approximation  (which  is  not.  in  general,  the  centroid  of  the 
solution  set)  coincides  with  the  maximum  entropy  solution.  These  can  be  tested  to  determine 
whether  the  (much  cheaper)  projection  approximation  method  can  be  substituted  for  direct 
maximization  of  entropy.  But.  as  argued  above,  there  is  no  need  to  compute  any  type  of 
point-valued  estimate  of  condition  probabilities  for  a  decision  problem. 

6.  Anytime  Decision  Making  with  Probabilistic  Databases 

A  probabilistic  database  [Cavallo  and  Pittarelli.  1987:  Pittarelli.  1994:  Barbara  et  al.  1993] 
generalizes  a  relational  database  by  replacing  the  characteristic  function  of  a  relation  with  a 
probability  distribution  (or  probability  intervals).  For  example,  the  tables  below  represent 
estimates  of  probabilities  for  various  (joint)  events  on  a  typical  August  day  in  a  fictitious 
developing  country: 

Rain  No  Phones 


yes 

true 

0.4 

yes 

false 

0.1 

no 

true 

0.2 

no 

false 

0.3 
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No  Phones  Trains 


true 

yes 

0.25 

true 

no 

0.35 

false 

yes 

0.25 

false 

no 

0.15 

No  Phones 

Temperature 

true 

high 

0.45 

true 

med 

0.1 

true 

low 

0.05 

false 

high 

0.25 

false 

med 

0.1 

false 

low 

0.05 

Temperature 

Humidity 

P* 

high 

high 

0.6 

high 

low 

0.1 

med 

high 

0.15 

med 

low 

0.05 

low 

high 

0 

low 

low 

0.1 

Suppose,  again,  that  it  must  be  decided  whether  or  not  to  go  to  the  beach.  It  is  believed  that 
the  only  relevant  conditions  are  whether  or  not  it  will  rain  and  whether  or  not  evening  trains 
will  run.  Utilities  this  time  are: 


Don't  go 

Go 

(rain,  train) 

3/4 

1/2 

(rain,  no  train) 

7/8 

0 

(no  rain,  train) 

1/8 

1 

(no  rain,  no  train) 

1/2 

5/8 

Conditional  independence  relations  that  would  permit  calculation  of  a  unique  (maximum 
entropy)  joint  distribution  over  all  of  the  attributes  mentioned  in  the  tables,  and  from  which 
(by  marginalization)  a  unique  probability  distribution  over  the  four  Joint  conditions  for  the  deci¬ 
sion  problem  could  be  calculated,  are  not  assumed.  Nonetheless,  It  can  be  determined  from 
the  database  that  exactly  one  of  the  actions  is  E-admissible.  There  are  infinitely  many  distribu¬ 
tions  over  the  Cartesian  product  of  the  domains  of  the  attributes  in  the  database  whose  margi¬ 
nals  coincide  with  the  distributions  in  the  database.  Each  of  these  is  a  solution  to  a  system  of 
20  linear  equations  in  48  unknowns.  (The  solution  set  is  referred  to  as  the  extension  of  the 
database,  defined  formally  below.)  The  probabilities  of  any  of  the  4  rain/train  conditions  is  the 
sum  of  12  of  the  48  unknowns.  Thus,  E-admissibility  can  be  determined  as  in  the  previous 
examples. 

However,  we  have  seen  that,  for  various  systems  of  probabilistic  logic.  It  is  not  necessary  to 
take  into  account  all  of  the  available  sentences  (even  those  that  are  relevant  in  the  sense  of 
having  an  effect  on  the  entailed  probabilities  of  the  conditions)  in  order  to  solve  decision  prob¬ 
lems.  Similarly,  working  with  an  entire  database  may  introduce  unnecessary  expense.  Anytime 
algorithms  can  be  devised  as  well  for  decision  making  with  probabilistic  databases. 
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Let  us  introduce  a  bit  of  formalism. 


The  set  of  attributes  over  which  a  probability  distribution  p  (i.e.,  element  of  a  probabilistic 
database)  is  defined  is  refered  to  as  its  scheme  and  is  denoted  s(p).  The  scheme  of  a  data¬ 
base 

P  =  {P^ . P*} 

is  the  set  of  schemes  of  its  elements: 

s(P)  =  {s(p^) . s(p*)}. 

The  scheme  of  the  database  {p^.p^.p^.p*}  above  is 

{{Rain.No  Phones}. {No  Phones.Trains}.{No  Phones. Temp.}. {Temp.. Humidity}}. 
Scheme  S  is  a  refinement  of  scheme  S',  denoted 

S  <  S'. 

iff  for  each  1/gS  there  exists  a  U^gS'  such  that  VCV. 

Each  attribute  v  has  a  domain,  its  set  of  possible  values,  denoted  dom{v).  In  the  example. 

cforn(Temperature)  =  {high.  med.  low}. 

Where  some  arbitrary  but  fixed  ordering  is  assumed  over  the  attributes,  the  domain  of  a 
scheme  is  the  Cartesian  product  of  the  domains  of  its  elements: 


dom{V)  = 


X  domiv). 
vev 


So.  with  No  Phones  preceding  Trains  in  the  ordering. 

dom{s[p^))  =  {<true.yes>.<true.no>.<false.yes>.<false.no>}. 

The  projection  of  a  distribution  p  onto  a  set  of  attributes  A  such  that  A  C  s(p).  denoted 

'’^Ai.P)- 

is  obtained  by  adding  probabilities  of  tuples  that  agree  on  values  for  attributes  in  A‘. 

^a{p){x)  =  S  P(0- 

t€dom{s{p)).  t[A\=x 

where  t\A\  denotes  the  restriction  of  tuple  tto  attributes  contained  in  set  A. 


A  database  may  be  projected  onto  any  scheme  that  is  a  refinement  of  its  own.  The  result  is  a 
database  whose  elements  are  marginals  of  its  own  elements.  For  example,  the  projection  of 
the  database  above  onto  the  scheme  {{Trains}. {Temperature}}. 

'X[{Tr3ins}.{Temperature}}{{.P  'P  -P  ’P  })• 


is: 


Trains  p^  Temperature  p® 

yes  0.5  high  0.7 

no  0.5  med  0.2 

low  0.1 


An  individual  distribution  may  also  be  projected  onto  a  database  scheme.  If 

X  <  {s(p)}. 

then 

^x(p)  =  {’'■v'(p)  I  VeX}. 

Databases  created  in  this  way  are  guaranteed  to  exhibit  various  forms  of  consistency.  For 
example,  such  databases  are  locally  consistent.  A  database  K  is  locally  consistent  iff.  for  any 
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p‘  and  in  K, 

for  all  A  C  s(p')  n  s(p^). 


The  extension  of  a  distribution  p  to  a  set  of  attributes  VD  s (p)  is  the  set  of  all  distributions 
over  dom  ( V)  whose  projection  onto  s  (p)  equals  p: 

^‘^(P)  =  {P'€Pv|’rs(p)(p')  =  p}. 

The  extension  of  a  database  to  set  of  attributes  is  the  intersection  of  the  extensions  of  its  ele¬ 
ments: 


£''{P)  =  n,epC‘'(p). 

The  superscript  V  may  be  omitted  if 


l^  =  ^y^5(p) 


A  database  P  is  globally  consistent  iff  its  extension  E{P)  is  non-empty. 


Databases  formed  by  projection  are  guaranteed  to  be  both  locally  and  globally  consistent.  So 
are  databases  whose  schemes  are  partitions.  Global  consistency  implies  local  consistency, 
regardless  of  the  structure  of  the  database  scheme.  However,  local  consistency  implies  global 
consistency  only  for  databases  whose  schemes  are  a-acyclic  [Fagin,  1983].  A  polynomial-time 
test  for  a-acyclicity  of  a  database  scheme  X  is  the  following: 

1.  W:=  X, 

Apply  in  any  order  until  neither  is  applicable: 

2.  If  V  is  an  element  appearing  in  only  one  Ve  W, 

then  IV:= 

3.  If  Vj  C  Vj  for  any  Vj.  VjE  W.  i\=j. 

then  W:=  W-{Vi}. 

X  is  a-acyclic  iff  the  algorithm  terminates  with  W={0}. 


The  database  algebra  just  sketched  can  be  utilized  to  define  various  anytime  decision-making 
algorithms.  The  key  fact  is  that  if  database  scheme  S  is  a  refinement  of  S',  then  the  extension 
(to  any  number  of  attributes)  of  the  projection  of  a  database  onto  S'  is  a  subset  of  the  exten¬ 
sion  of  the  projection  onto  S  [Pittarelli.  1994).  Formally. 


X  <  K  Implies  E^(7rK(P))  C  E^[kx{P))- 

(This  is  due  to  the  fact  that  the  linear  equations  the  solutions  to  which  are  the  elements  of 
SI"®  linear  combinations  of  the  equations  in  the  system  corresponding  to 

£>y{p))) 


Thus.  If  an  action  is  E-admissible  relative  to  the  set  of  probabilities  over  the  conditions  that 
can  be  calculated  from  a  database,  then  It  is  E-admissible  relative  to  the  probabilities  calcu¬ 
lated  from  any  projection  of  the  database.  Equivalently,  if  an  action  can  be  determined  not  to 
be  E-admissible  relative  to  a  projection,  it  can  be  inferred  that  it  is  not  E-admissible  relative  to 
the  original  database. 


Since  the  number  of  E-admissible  actions  decreases  monotonically  as  schemes  become  less 
refined,  anytime  decision  methods  are  possible  for  problems  in  which  the  set  of  conditions  is 
the  Cartesian  product  of  attribute  domains  from  the  database  (or  can  be  constructed  from  the 
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tuples  in  such  a  product).  Let  Vc  denote  this  set  of  attributes. 

This  suggests  the  following  anytime  strategy:  Starting  with  W={{v}\v^Vc},  the  most 
refined  scheme  that  is  a  cover  of  some  V  D  Vc.  repeatedly  aggregate  W  until  there  is  a  single 
admissible  action  relative  to  or  7r^('^(E(7rw(/f)))  happens  to  contain  only  one 

distribution,  or  W=s{P).  whichever  comes  first.  However,  if  database  schemes  are  replaced 
by  immediate  aggregation,  very  little  progress  toward  sufficient  reduction  of  the  set 
ir\/^{E{ir\/^{K)))  is  likely  to  be  made  at  each  step.  Also,  there  will  not  be  a  unique  immediate 
aggregate  at  each  step.  A  reasonable  alternative  is  the  sequence  of  schemes: 

({{v}|vGl/c}.  {VidVcl  ViEsiP).  VinVc^<Z)}.  {Vi\ V;-€s(P).  ViHVc^Q)}.  s(E)). 

Following  this  strategy,  suppose  that  a  single  admissible  action  is  identified  with 
W={{v}|  vEVc}-  Then,  for  m  actions,  linear  programs  with  only  \dom[Vc)\  unknowns  and 
m— 1  inequalities  are  sufficient. 

For  the  beach  decision  problem.  Vc  =  {Rain.  Trains}.  The  projection  of  the  database  onto 

{{v}jveWc}  is 


Trains 

p'' 

Rain 

P^ 

yes 

0.5 

yes 

0.5 

no 

0.5 

no 

0.5 

Both  actions  are  E-admissible  relative  to  the  set  of  Joint  probabilities  compatible  with  this 
database. 


For  this  problem. 

{l^nl^cl  l/es(P).  VnVc*<Z)}  =  {{•'}keVc}. 

The  projection  onto  {V\  Ves{P).  VnVc^Q)}  is  the  set  of  distributions  {p^.p^}.  above. 
"Don't  go'  is  identified  from  this  set  of  distributions  as  uniquely  E-admissible.  There  exist 
feasible  solutions  to  the  system  of  inequalities  below,  but  not  to  the  corresponding  system  for 
"Go". 

p [Rain  =  yes. No  Phones = true.  Trains— yes)  +  p [yes. true, no)  =0.4 
p  [yes.  false,  yes)  +  p  [yes.  false,  no)  =0.1 
p  [no,  true,  yes)  +  p  [no.  true,  no)  =  0.2 
p  [no.  false,  yes)  +  p  [no.  false.no)  =  0.3 
p[yes.true.yes)  -F  p (no. true. yes)  =  0.25 
p  [yes,  true,  no)  -F  p  [no.  true,  no)  =  0.35 
p  [yes.  false,  yes)  -F  p  [no.  false,  yes)  =  0.25 
p  [yes.  false.no)  +  p  [no.  false.no)  =  0.15 


3  1 

(- — — )  X  (p  (yes.  true. yes)  +p  [yes.  false.yes)) 

4  2 

-F  (^— 0)x(p (yes. true. no) -Fp (yes.  fe/se. no))  -F 
8 


(-^—1)  X  (p  (no.  true,  yes)  -Fp  (no.  false.yes)) 

8 

1  5 

-F  (— — — ) x(p(no.troe.no)-Fp(no. fa/se.no))  >  0. 

2  8 
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The  set  \dom{V)\  can  be  arbitrarily  large,  and  many  (or  all)  of  the  variables  in  V—Vc  may  be 
irrelevant  to  the  given  problem.  If  the  conjecture  below  is  correct,  then  there  exists  a 
polynomial-time  algorithm  for  eliminating  irrelevant  attributes  from  a  connected  a-acyclic  data¬ 
base  scheme.  A  database  scheme  s{P)  is  connected  iff  there  is  no  VEs{P]  such  that  there 
does  not  exist  a  V'es(P).  V¥-V  such  that  l/nl/'¥=0. 


For  s{P)  a  cover  of  VOV^.  define  a  bipartite  graph  B  with  node  set  l/'Us(/’)  and  edge  set 

{{v.  x}|  veV'.x€s(P).  v€x}. 

Let 

R=  {v€V\  vE  Vc  or  V  is  on  an  acyclic  path  in  B  between  two  elements  of  Vc}. 

Let 

Z={Vf\R\  Vies{P)  and  W-n/?  ^  l/yd/?.  for  all  Vjes{P).i^j}, 

Conjecture: 


If  s[P)  is  connected  and  a-acyclic  then  the  following  polynomial-time  algorithm  will  produce 
the  reduced  structure  Z[Maier  and  Ullman.  1982]: 

1.  Z:=s(P). 

2.  Repeat  in  any  order  until  neither  has  any  effect  on  the  current  value  of  Z: 

a.  If  a  variable  v  ^Vc  appears  in  only  one  element  of  Z.  remove  v  from  that  element. 

b.  If  Z contains  elements  Vj  and  Vj  such  that  1//C  Vj.  then  Z:=Z—{Vj}.  □ 

In  the  unlikely  event  that  the  conjecture  is  proved  false,  it  is  still  guaranteed  to  be  the  case 
that 

{Vi\ Vies{P).  vpVc*(Z)}  <z<  s{P). 

Therefore.  Z  can  be  inserted  between  these  two  schemes  in  the  aggregation  sequence  sug¬ 
gested  above. 


It  is  difficult  to  achieve  consistency  for  real-valued  databases  whose  schemes  are  not  partitions 
and  for  which  probabilities  are  assessed  on  each  element  separately.  Apart  from  the  problem 
of  consistency,  it  is  also  difficult  to  estimate  real-valued  probabilities  with  any  degree  of 
confidence. 


To  illustrate,  imagine  that  data  are  generated  via  two  coin  tossing  experiments.  In  the  first 
experiment,  coins  Cj  and  C2  are  tossed  simultaneously  250  times  with  resulting  frequency  dis¬ 
tribution 


Cl  C2  ^ 

H  H  70 

H  T  60 

T  H  40 

T  T  80 

Coin  C2  is  again  tossed  in  the  second  experiment,  this  time  with  coin  C3: 
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C2 

C3 

H 

H 

65 

H 

T 

50 

T 

H 

70 

T 

T 

65 

If  probabilities  are  estimated  as 

relative  frequencies. 

,  we 

obtain  the  database: 

Cl 

C2 

C2 

C3 

H 

H 

0.28 

H 

H 

0.26 

H 

T 

0.24 

H 

T 

0.20 

T 

H 

0.16 

T 

H 

0.28 

T 

T 

0.32 

T 

T 

0.26 

Distributions  and  p^  are  inconsistent: 

C2 

’r{C2}(P  ) 

*  C2 

’>-{c.}(P^) 

H 

0.44 

H 

0.46 

T 

0.56 

T 

0.54 

Just  as  interval-valued  probabilities  can  be  attached  to  statements  in  Nilsson’s  and  Frisch  and 
Haddawy’s  systems  of  probabilistic  logic,  we  can  assign  probability  intervals  to  the  tuples  in  a 
database.  The  operations  of  projection  and  extension  are  easily  modified  to  handle  probability 
intervals  (and  in  such  a  way  that  they  reduce  to  the  definitions  given  above  for  real-valued 
databases  in  the  extreme  case  in  which  the  intervals  themselves  reduce  to  real  numbers). 


For  the  coin  experiments,  we  can  report  confidence  intervals,  instead  of  real-valued  probabili¬ 
ties.  95%  intervals  for  the  data  (based  on  250  observations)  are: 


Cl  C2 

H  H  [0.22. 0.34] 

H  T  [0.18. 0.30] 

T  H  [0.10. 0.22] 

T  T  [0.26. 0.38] 

The  interval  distributions  and  ai 
/  =  {/^ . /'”}  is  locally  consistent  iff 

’rs(^')ns(/*)(^‘^)(t)'~'’r, 

for  all  fG</o/n(s(/^)n(s (/*)),  where  J.k€{l . 


Q  ^3 _ 

H  H  [0.20. 0.32] 

H  T  [0.14. 0.26] 

T  H  [0.22. 0.34] 

T  T  [0.20, 0.32] 

consistent.  An  interval-valued  database 

f(/^jns(/*)('*)(0  ^  0- 
m}  and  s(/^)ns(/*)  ¥=  0. 


The  projection  of  an  interval  distribution  onto  any  subset  of  its  scheme  is  itself  an  interval  dis¬ 
tribution.  The  endpoints  of  a  component  of  the  projection  are  the  minimum  and  maximum 
values  of  the  components  of  the  real-valued  projections  of  the  real-valued  distributions 
enveloped  by  /.  The  endpoints  can  be  determined  by  linear  programming.  For  example,  the 
projection  ‘^{c2}V^)  determined  from  the  system  L  of  inequalities 


19-17 


p(HH)  +p(r//) 
p{HT)+p(TJ) 
p[HH)  >  0.22 
p{HH)  <  0.34 


P{M) 

p(n 


p{TT)>  0.26 
p(TT)<  0.38 
p(/y)  +p(7)  =1 

(which  implies  p{HH)  +  •  •  •  +  p{TT)  =  1)  as 

^{c2}U^){^)  =  {pW)  \pW)  satisfies  L}.  max  {p{H)  \p{H)  satisfies  /.}] 
and 

{p(71  |p(T)  satisfies  /.},  max  {piT)\p{T)  satisfies  L}] 
The  projections  onto  {02}  of  and  are: 


H  [0.32.0.56] 

T  [0.44.0.68] 


^2 


H  [0.34.0.58] 
T  [0.42.0.66] 


Notice  that,  although  they  can  in  this  case,  in  general,  the  endpoints  of  the  marginals  cannot 
be  obtained  simply  by  adding  the  endpoints  of  the  corresponding  joint  intervals.  Had  i^{HT) 
been  reported  instead  as  [0.18.0.31].  the  value  of  max  would  remain  0.68.  Max 

~  +  0-38)  would  Imply  the  possibility  that 

p{HTj  +  p{Tr\  =  0.69. 

which  implies 

p{HH)  +  p{TH)  =  0.31. 

which  contradicts  the  constraints  p{HH)  >  0.22  and  p{TH)  >  0.10. 

The  real-valued  extension  of  an  interval  distribution  or  database  is  the  set  of  real-valued  distri¬ 
butions  (over  the  specified  domain)  satisfying  the  linear  inequalities  implied  by  the  given  inter¬ 
vals.  An  interval-valued  extension  can  also  be  defined. 

Let  us  say  that  /  is  more  informative  than  /".  denoted 

/  < 
iff 

;(t)  c  r{t). 

for  all  tuples  f.  Where  E^{i)  denotes  the  real-valued  extension  of  database  /.  and  where 
E^[l)*  denotes  the  narrowest  interval  distribution  enveloping  E^[l).  the  interval-valued  exten¬ 
sion  of  /  is  the  set  of  interval  distributions 

{i\p^i^E^{l)  *,  for  some  p€f '^(/)}. 

where  a  real-valued  distribution  is  regarded  as  a  special  type  of  interval  distribution.  This  set 
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is  partially  ordered  under  its  structure  is  quite  similar  to  that  of  solution  sets  for  finite 
fuzzy  relation  equations  [Higashi  et  al..  1984]. 

For  decision  making,  the  same  strategies  used  with  real-valued  databases  are  applicable  in  the 
interval-valued  case.  A  series  of  decision  problems  is  generated,  using  larger  and  larger  por¬ 
tions  of  the  database  for  each  successive  problem.  The  only  difference  is  that  E-admissibility  is 
tested  against  a  system  of  linear  inequalities. 

Note  that  even  relative  to  the  (projection  of)  the  extension  of  an  entire  database  there  may  be 
more  than  one  E-admissible  action.  If  this  is  so.  then,  if  the  database  is  interval-valued.  Loui's 
methods  [1986]  may  be  applied  to  narrow  them.  If  the  intervals  are  confidence  intervals,  this 
amounts  to  merely  reducing  the  confidence  level  in  stages  and  checking  at  each  step  for  the 
existence  of  a  unique  E-admissible  action.  Alternatives  applicable  to  point-valued  probabilistic 
databases  are  the  variable  and  structural  refinements  discussed  by  Poh  and  Horvitz  [1993]  and 
"coarsenings'  of  the  database  scheme.  The  latter,  which  includes  structural  refinement  as  a 
special  case  (i.e.  may.  but  needn't,  introduce  new  variables)  requires  the  assessment  of  joint 
probabilities  over  supersets  of  the  sets  of  variables  contained  in  the  original  database  scheme. 
If  the  old  database  is  a  projection  of  the  new  database,  then  the  new  set  of  E-admissible 
actions  is  a  subset  of  the  old. 

7.  Conclusion 

Anytime  decision  methods  may  be  devised  for  use  with  probabilistic  databases.  Frisch  and 
Haddawy's  anytime  deduction  system,  and  Nilsson's  probabilistic  logic.  Common  to  each  of 
these  methods  is  the  generation  of  a  system  of  linear  inequalities  the  unknowns  of  which  are 
probabilities  of  the  conditions  for  a  decision  problem.  Levi's  E-admissibility  criterion  may  be 
applied  to  the  solution  set  of  the  system  of  inequalities.  The  size  of  the  system  of  inequalities 
increases,  and  the  set  of  admissible  actions  shrinks,  as  more  of  the  knowledge  base  or  data¬ 
base  is  taken  into  account. 

Specific  measures  of  the  quality  of  a  decision  are  not  explored.  It  seems  that,  for  a  fixed  set  of 
actions  under  consideration,  reasonable  measures  will  be  such  that  the  quality  of  the  decision 
based  on  a  set  E  of  admissible  actions  will  be  higher  (ignoring  the  cost  of  computation)  than 
that  of  any  decision  based  on  a  superset  of  E  For  each  of  the  methods  discussed,  actions  are 
eliminated  from  consideration  as  computation  proceeds.  Thus,  the  quality  of  a  decision  (made 
by  choosing  an  action  from  the  currently  admissible  set  using  some  criterion  other  than  E- 
admissibility)  increases  with  time. 

Determining  which  sentences,  or  projections  of  a  database,  will  eliminate  the  greatest  number 
of  actions  at  the  least  cost,  and  whether  it  is  worth  the  effort  to  consider  additional  sentences 
or  projections  at  all.  is  a  difficult  problem  which  remains  for  future  research. 
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Abstract 

We  have  studied  the  ultrafast  optical  switching  dynamics  of  type-1 1  GaAs/AIAs 
superlattices,  InGaAs  asymmetric-coupled-quantum  well  (ACQW)  materials  and 
GaAs/AIGaAs  vertical-cavity  surface-emitting  laser  (VCSEL)  structures,  with  an  eye 
towards  the  development  of  high-speed  optoelectronic  modulators  having  unique 
operational  advantages  over  current  designs.  To  determine  the  absorption  and  refractive- 
index  dynamics  of  these  materials  and  to  optimize  device  designs  utilizing  them,  a  flexible 
pump-probe  characterization  facility  with  femtosecond  resolution  in  the  visible,  near  IR,  and 
1.3-1.55  pm  ranges  is  required.  In  this  report  we  summarize  our  efforts  to  design  and 
construct  a  suitable  femtosecond  pump-probe  system,  and  outline  preliminary  attempts  to 
exploit  ultrafast  switching  mechanisms  in  type-ll,  ACQW  and  VCSEL  materials  for  optical 
processing  applications. 
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Background  --  Materials  and  Devices  for  Optoelectronic  Switching/Modulation 

Asymmetric  Fabry-Perot  modulators  (AFPMs)  fabricated  from  multiple  quantum  well 
(MOW)  materials  have  attracted  considerable  attention  in  recent  years  as  potential 
elements  in  optical  information  processing,  interconnection,  and 
multiplexing/demultiplexing  systems.  Both  electro-absorption^  '*  and  all-optical^"®  devices 
have  been  demonstrated.  Unfortunately,  despite  impressive  performance  characteristics, 
including  21  GHz  modulation  rates®  and  1000:1  contrast  ratios'*,  AFPMs  have  yet  to  move 
from  the  university  lab  into  the  commercial  marketplace. 

Three  key  drawbacks  of  previously-studied  material  structures  can  be  identified  which  have 
prevented  AFPMs  from  finding  their  way  into  practical  high-speed  processing  systems. 
First,  the  optical  switching  effect  used  in  many  of  these  devices  is  critically  dependent  on 
precise  operating  wavelengths.  Second,  the  vast  majority  of  the  AFPM-based  switching 
components  demonstrated  has  been  incompatible  with  the  fiber-relevant  1 .33  and  1 .55  pm 
spectral  regions.  Finally,  though  the  bulk  of  the  AFPM  concepts  discussed  in  the  literature 
display  extremely  rapid  switch-on  characteristics,  their  overall  processing  rates  tend  to  be 
strongly  limited  by  their  recombination-time-dependent  switch-off  times,  which  are  typically 
on  the  order  of  1  ns  or  more. 

Wavelength  Sensitivity 

Most  of  the  electro-absorption  modulators  studied  to  date  operate  using  the  quantum- 
confined  Stark  effect  (QCSE),  while  the  all-optical  AFPMs  tend  to  rely  on  intensity- 
dependent  bleaching  of  excitonic  absorption.  The  QCSE  and  excitonic  bleaching  both 
utilize  carrier-density-dependent  shifting  of  excitonic  resonances  to  bring  about  absorption 
and  refractive-index  changes.  The  relatively  narrow  linewidth  of  excitonic  transitions  leads 
to  high  contrast  but  makes  the  resulting  devices  extremely  sensitive  to  operating 
wavelength.  Thus  a  material  which  could  undergo  significant  absorption  and  refractive- 
index  changes  at  a  wider  range  of  operating  wavelengths  would  contribute  to  device 
robustness  and  manufacturability.  Tolerance  to  diode  laser  linewidth,  process  variation, 
temperature  tuning,  etc.  could  also  be  enhanced  as  a  result. 

Fiber  Compatibility 

Research  concerning  AFPMs  has  focused  strongly  on  the  GaAs/AIGaAs  and  strained 
GaAs/InGaAs  material  systems,  producing  operating  wavelengths  in  the  860  and  980  nm 
regions.  Though  these  wavelengths  are  compatible  with  inexpensive  off-the-shelf  diode 
laser  sources,  they  fail  to  effectively  exploit  the  well-established  transmission  "windows" 
in  optical  fiber.  As  a  result,  research  groups  have  begun  searching  for  high-performance 
modulators  at  the  communications-relevant  wavelengths  of  1.3  and  1.55  pm,  where 
significant  commercial  applications  can  be  readily  anticipated.  Particularly  promising  are 
devices  based  on  the  InAIGaAs  system,  which  can  be  lattice-matched  to  InP.  As  with 
earlier  GaAs/AIGaAs  work,  both  QCSE  electro-absorption  and  all-optical  AFPMs  have 
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been  successfully  fabricated^'^  Perhaps  the  most  encouraging  result  to  date  in  this  regard 
is  the  work  of  Krol  et  ai.®.  who  reported  1 0OO.i  contrast  in  an  InAIGaAs/lnAIAs  MQW  AFPM 
operating  at  1 .3  pm.  The  operating  speed  of  this  all-optical  device  was  found  to  be  limited 
by  the  725  ps  recovery  time  of  the  excitonic  optical  nonlinearities  to  just  under  1  GHz. 

Switching  Recovery  Time 

Most  optoelectronic  and  all-optical  switching  devices  based  on  bandedge  nonlinearities  are 
limited  in  their  switch-off  speed  by  the  recombination  time  Tr  of  the  electron-hole  pairs 
created  during  the  initial  switch-on  process.  Typical  values  of  Tr  for  GaAs/AIGaAs-based 
material  structures  are  between  1  and  10  ns.  Though  sub-nanosecond  recovery  times  can 
be  achieved  through,  e.g.,  enhanced-surface-recombination  geometries  or  proton 
implantation,  the  resulting  device  operating  frequencies  are  still  limited  to  approximately 
2-5  GHz.  Since  electronic  systems  routinely  operate  in  the  1-10  GHz  range,  potential 
optoelectronic  replacements  will  have  to  deliver  significantly  better  performance  if  they  are 
to  be  commercially  viable.  To  raise  potential  operating  frequencies  of  optical  switching 
components  to  the  range  of  50  GHz,  picosecond  or  subpicosecond  recovery  mechanisms 
are  strongly  desirable. 


Enhancing  AFPM  Performance 

My  research  this  summer  dealt  with  three  different  material/device  structures  with  the 
potential  to  overcome  the  limitations  of  existing  AFPM  schemes:  GaAs/AIAs  Type-ll 
superlattices,  InGaAs/lnAIAs  asymmetric  coupled  quantum-wells  (ACQWs)  and  vertical- 
cavity  surface-emitting  laser  (VCSEL)  structures. 

Type-ll  Superlattices 

A  nonlinear  electroabsorption  effect  is  available  in  GaAs/AIAs  heterostructures  which  is 
potentially  more  efficient  and  robust  than  the  quantum-confined  Stark  effect  at  visible 
wavelengths  and  significantly  less  wavelength-sensitive.  This  mechanism  utilizes  so-called 
type-ll  superlattices,  in  which  narrow  GaAs  wells  and  wider  AlAs  barriers  lead  to  a  band 
alignment  which  is  indirect  in  both  real  and  reciprocal  space.  Meynadier  and  co-workers 
have  shown^°  that  under  an  applied  electric  field,  such  structures  can  be  induced  to  switch 
their  band  alignment  from  type-ll  to  type-1,  i.e.,  from  quasi-indirect  to  direct,  albeit  at  low 
temperature.  Since  the  nonlinear  absorption  characteristics  of  type-ll  and  type-1  materials 
differ  considerably^ \  it  has  been  suggested  that  a  nominally  type-ll  material  could  exhibit 
strong  absorption  changes  under  an  applied  electric  field.  Because  the  switching 
mechanism  involved  depends  on  field-induced  energy-level  shifts  (i.e.,  carrier  redistribution 
among  wells  and  barriers)  rather  than  solely  on  carrier  creation,  switch-off  speeds  are  a 
function  of  tunneling  times  instead  of  recombination  rates.  As  a  result,  recovery  times  can 
be  of  the  order  of  a  few  picoseconds.  Type-ll  modulators  could  thus  potentially  operate 
at  frequencies  of  the  order  of  50-100  GHz  with  significantly  better  tolerance  to  source 
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wavelength  variation  than  QCSE-based  devices. 

We  have  designed  and  fabricated  AFPM  devices  utilizing  type-1 1  superiattice  spacer  layers 
with  the  goal  of  demonstrating  significant  switching  contrast  utilizing  the  type-1  l->type-l 
transition.  We  have  also  studied  type-ll  p-i-n  structures  with  the  goal  of  optimizing  design 
of  type-ll  materials  for  use  at  room  temperature^^.  Further  measurements  using 
photoluminescence  and  ultrafast  pump-probe  techniques  in  the  600-700  nm  region  are 
required  before  these  goals  can  be  achieved.  The  resulting  devices  should  demonstrate 
robust  wavelength  response,  femtosecond-on/picosecond-off  switching,  and  room 
temperature  operation.  Though  the  650-700  nm  operating  wavelength  of  these  devices 
is  incompatible  with  silica  fiber,  it  provides  an  excellent  match  to  transmission  windows  in 
rugged,  low-cost  plastic  fiber. 


InGaAs/lnAIAs  ACQWs 

As  mentioned  above,  the  all-optical  InAIGaAs/lnAIAs  MOW  AFPM  concept  pioneered  by 
Krol  et  al.  operates  at  1 .3  pm  but  is  limited  by  recombination  to  a  1  GHz  modulation  rate. 
Recent  results  from  the  same  group  imply  that  the  recovery  time  could  be  lowered  to  about 
20  ps  through  the  use  of  asymmetric  coupled  quantum-well  (ACQW)  structures  in  the 
InGaAs/lnAIAs  spacer  layer^^.  As  of  yet,  this  fast  recovery  mechanism  is  not  well 
understood,  and  its  device  implications  are  still  uncertain.  Our  research  aims  to  pinpoint 
the  relevant  physical  processes  underlying  the  fast  recovery,  determine  the  corresponding 
absorption  and  refractive-index  dynamics  that  accompany  it,  and  design  AFPM  structures 
to  utilize  it  for  all-optical  switching/routing  functions  in  time-division-multiplexed  network 
applications.  The  ideal  tools  for  this  effort  are  again  photoluminescence  and  femtosecond 
pump-probe  spectroscopy.  In  this  case,  however,  tunable  femtosecond  pulses  in  the  1 .3 
pm  region  are  required. 

VCSELs  operated  as  transmissive  switching  elements  with  gain 

A  third  scheme  for  providing  practical  high-speed  optoelectronic  switching  elements 
involves  adapting  vertical-cavity  surface-emitting  lasers  for  use  as  transmission-mode 
amplifying  switches.  When  operated  slightly  below  threshold,  such  devices  should 
demonstrate  rapid  nonlinear  switch-on  and  switch-off  characteristics  due  to  stimulated 
recombination  within  the  gain  region  of  the  absorption  spectrum’^  Their  amplification 
properties  will  allow  for  low-power  addressing  and  cascadability.  We  have  designed  such 
devices  and  performed  preliminary  theoretical  modeling  of  their  performance. 
Femtosecond  pump-probe  analysis  of  their  optical  response  will  allow  experimental 
feedback  to  the  modeling  process  and  enable  the  optimization  of  cavity  parameters, 
fabrication  techniques,  and  device  performance  characteristics.  Such  measurements  will 
once  more  require  a  tunable  source  of  femtosecond  pulses,  this  time  in  the  near-IR  portion 
of  the  spectrum  between  750  and  900  nm. 
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Femtosecond  Pump-Probe  Spectroscopy 

The  design  of  asymmetric  Fabry-Perot  modulators  requires  understanding  of  both  the 
absorption  and  refractive-index  dynamics  of  the  spacer  materia!  as  a  function  of  excited 
carrier  density.  The  refractive  index  determination  is  crucial  to  the  precise  resonance¬ 
matching  necessary  to  achieve  very-low  reflectivities  and  thus  high  contrast  ratios.  While 
differential  transmission  studies  (whether  with  degenerate  or  non-degenerate  pumping) 
can  give  ultrafast  absorption  response  at  one  more  isolated  frequencies,  they  give  no 
information  on  the  time  evolution  of  the  index.  Induced  changes  in  the  index  spectrum 
An(a),t)  can  be  calculated  using  the  Kramers-Kronig  transformation  applied  to  the  dynamic 
absorption  behavior  Aa((o,t),  but  the  absorption  changes  must  be  known  at  a  wide  range 
of  closely-spaced  frequencies  for  the  calculated  An  values  to  be  reliable.  Thus  the  ideal 
experiment  would  extract  the  femtosecond-time-scale  evolution  of  the  entire  near- 
bandedge  absorption  spectrum  of  the  spacer  material  with  high  resolution  in  a  series  of 
"single  shot"  measurements,  allowing  accurate  determination  of  the  accompanying 
refractive-index  spectrum  using  the  Kramers-Kronig  method.  The  a  and  An  values  could 
then  be  inserted  into  the  standard  transfer-matrix  formalism  for  evaluating  and  optimizing 
AFPM  reflectance/switching  behavior. 

The  preferred  method  for  performing  simultaneous  multiple-wavelength  absorption 
spectroscopy  is  through  the  technique  of  femtosecond  continuum  generation’®.  In  this 
approach,  a  femtosecond  pulse  is  amplified  many  times  until  its  energy  is  sufficient  to 
generate  a  femtosecond  broadband  white-light  continuum  in  a  material  such  as  ethylene 
glycol  or  quartz.  Though  the  earliest  continuum-generation  set-ups  relied  on  CPM  lasers 
and  multiple  dye-jet  amplifier  stages,  these  have  quickly  given  way  to  all-solid-state 
systems  based  on  mode-locked  TiiSapphire  oscillators  and  Nd:YAG/YLF-pumped 
Ti:Sapphire  regenerative  amplifiers’®.  Microjoule  fs  pulses  are  readily  achieved  by  both 
laboratory  and  commercial  Ti;S  regen  systems,  enabling  the  generation  of  white-light 
pulses  containing  1 0's  of  nanojoules  of  energy.  By  taking  a  1 0  nm  "slice"  out  of  the 
broadband  signal  using  an  interference  filter,  we  can  obtain  a  wavelength  selectable  fs 
pump  pulse.  The  remainder  of  the  continuum  pulse  provides  the  broadband  probe  signal 
we  need  to  determine  the  absorption  response  to  the  pump  radiation.  Pump  and  probe 
are  both  focused  on  the  sample  under  test,  and  the  transmitted  signals  are  input  to  an 
optical  multichannel  analyzer,  which  records  the  transmission  spectrum  as  a  function  of 
delay  between  pump  and  probe,  typically  with  1 00  fs  resolution.  This  technique  has  been 
successfully  applied  to  a  wide  variety  of  organic  and  inorganic  materials  in  the  wavelength 
region  from  500-900  nm,  including  CdSe  microcrystallites  in  glass,  and  both  type-l  and 
type-ll  MQWs  and  superlattices. 

These  tools  are  only  just  now  becoming  applicable  to  the  1-2  pm  region  as  well,  through 
a  technique  known  as  optical  parametric  amplification.  In  an  optical  parametric  amplifier 
(OPA),  the  broadband  radiation  from  a  white  light  continuum  pulse  (generated  by  a  Ti:S 
regenerative  amplifier)  is  mixed  with  a  narrowband  UV  signal  (obtained  from  frequency 
doubling  a  portion  of  the  continuum)  inside  a  nonlinear  crystal  such  as  BBO.  By  combining 
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the  co-linear,  coherent  input  beams,  sum  and  difference-frequency  signals  can  be 
generated.  The  lower  frequency  pulse  is  called  the  idler,  while  the  sum  frequency  pulse 
is  referred  to  as  the  signal.  In  a  typical  OPA,  the  idler  signal  can  be  tuned  through  the  1  -2 
pm  region  by  simply  adjusting  the  orientation  of  the  BBO  crystal.  The  idler  pulse  can  then 
act  as  the  pump,  while  the  lower-frequency  tail  of  the  white-light  continuum  generated  by 
the  input  regenerative  amplifier  acts  as  the  broadband  probe  in  the  1 .3  pm  spectral  regime. 

System  to  be  Constructed 

Commercially  available  systems  generating  tunable  femtosecond  pulses  in  the  visible,  IR 
and  1-2  pm  regime  range  in  cost  from  about  $100,000  to  $280,000,  depending  on  whether 
or  not  the  purchaser  already  owns  a  pump  laser.  A  vastly  more  cost-effective  option  is  to 
"home-build"  a  regen/OPA  system  of  our  own.  This  is  the  goal  of  our  proposed  follow-on 
project  under  the  SREP  program.  The  planned  system  will  consist  of  the  following 
components: 

(1)  Coherent  310  10  watt  CW  Argon:lon  laser  (already  part  of  our  lab). 

(2)  Mode-locked  TiiSapphire  oscillator,  pumped  by  the  Coherent  31 0,  producing 
100  fs  pulses  at  780  nm  (design  and  some  component  fabrication  performed 
this  last  summer;  alignment  in  progress.) 

(3)  50  watt  Q-switched  mode-locked  frequency-doubled  Nd:YLF  laser  operating 
at  30  kHz  (under  construction). 

(4)  TiiSapphire  regenerative  amplifier,  pumped  by  the  NdiYLF,  including 
stretcher/compressor  optics,  a  Pockels  Cell/thin-film  polarizer  set-up  with 
which  to  out-couple  the  amplified  femtosecond  pulses,  and  a  quartz  cell  for 
continuum  generation  (components  to  be  purchased). 

(5)  Optical  parametric  amplifier,  with  one  LBO  crystal  for  frequency-doubling  a 
portion  of  the  continuum  from  the  regen,  and  a  second  for  generating  the 
idler  and  signal  pulses,  tunable  between  1-2  microns  (components  to  be 
purchased). 


Work  performed  this  summer 

In  accordance  with  the  areas  of  research  emphasis  discussed  above,  I  performed  the 
following  tasks  during  my  tenure  as  a  research  associate  this  last  summer: 

■Designed  type-ll  p-i-n  samples  for  temperature-dependent  PL  and 
electroabsorption  measurements. 
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■Directed  growth  of  the  samples  by  Leavitt  et  al.  of  the  Army  Research  Lab. 

■Carried  on  preliminary  conversations  with  Prof.  Nan-Marie  Jokerst  of  Georgia  Tech 
to  arrange  for  epitaxial  lift-off  of  transmission-mode  samples  by  her  group. 

■Reviewed  earlier  measurements  for  preparation  of  tech  report  summarizing 
moderate-temperature  type-1  l->type-l  transition  results. 

■Studied  ACQWs  in  InAIGaAs  material  system;  became  familiar  with  Mark  Krol's 
work  on  ultrafast  recovery  in  such  materials  at  low  temperature;  carried  on 
extensive  discussions  with  Mark  concerning  potential  device  design  issues  for 
AFPMs  fabricated  from  such  materials. 

■Performed  extensive  literature  review  covering  TiiSapphire-based  regenerative 
amplifier  systems,  OPA  technology  and  related  OMA/cryogenic  equipment  for 
femtosecond  pump-probe  spectroscopy. 

■Carried  out  vendor  search  for  components  for  home-built  regen/OPA  system. 

■Studied  TiiSapphire  Kerr-lens-mode-locked  oscillator  design;  performed  numerical 
simulations  allowing  optimization  of  cavity  configuration  for  self-starting  operation. 

■Designed  oscillator  and  custom  components  (crystal  mount,  etc.),  ordered 
components  and  built;  testing  and  alignment  now  underway. 

■Conducted  detailed  literature  search  concerning  recent  VCSEL  designs,  including 
work  of  Raj  et  al.’'*  concerning  VCSELs  for  modulation  and  switching. 


Future  Work 

As  is  clear  from  the  above  summary,  my  efforts  this  summer  have  basically  laid  the 
groundwork  for  an  aggressive  experimental  research  program  investigating  three  different 
semiconductor  materials  suitable  for  ultrafast  switching,  all  centered  around  a  tunable 
femtosecond  pump-probe  system.  My  immediate  short-term  goal  is  to  build  such  a  system 
and  get  it  producing  reliable  data.  To  this  end,  I  have  submitted  an  SREP  proposal  to  fund 
the  work  of  a  research  associate  who  can  assist  me  with  exactly  these  tasks. 
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SYNTHESIS  AND  PROPERTIES 
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Abstract 

A  study  was  initiated  to  investigate  both  the  synthetic  chemistry  and  properties  of  p- 
diketonate-modified  heterobimetal  lie  alkoxides  of  the  alkaline  earths  and  titanium.  A 
number  of  single  metal  diketonates  have  been  utilized  as  source  compounds  in  the 
MOCVD  processing  of  electronic  oxides  (e.g.,  BaTiOa,  YBa2Cu30^,  Pb(Zr,Ti)03,  etc.).  It 
was  proposed  to  extend  this  work  by  conducting  an  initial  screening  study  to  evaluate  if 
such  chelating  and  branching  ligand  systems  may  be  used  to  develop  new  "single-source" 
compounds  for  bimetallic  oxides  (e.g.,  BaTi03).  The  synthesis  and  properties  of  a  2,2,6,6- 
tetramethyl-3,5-heptanedionate  (thd)  modified  barium-titanium  isopropoxide  was 
specifically  studied.  The  results  of  this  initial  study  were  inconclusive,  but  do  provide  a 
technological  justification  for  further  study. 
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SYNTHESIS  AND  PROPERTIES 
OF 

P-DIKETONATE-MODIFIED  HETEROBIMETALLIC  ALKOXIDES 
(Single-Source  MOCVD  Precursors) 


Daniel  F.  Ryder,  Jr. 


Introduction 


Metalorganic  chemical  vapor  deposition  (MOCVD)  of  multicomponent  oxides  has 
recently  been  the  subject  of  extensive  research  study,  particularly  as  related  to  the  thin 
film  processing  of  the  cuprate  superconductors  (e.g.,  YBa2Cu30x,  Bi2Sr2Ca2Cu30y)  and 
the  ferroelectric  perovskites  (e.g.,  BaTi03,  (Ba,Sr)Ti03).  Current  commercial  practise 
involves  the  controlled  vaporization  and  transport  of  individual  metal  precursor 
compounds  to  achieve  stoichiometric  compositional  control  of  the  growing  film.  As  an 
attractive  processing  alternative  to  the  complicated  operational  strategies  associated  with 
conventional  multi-source  MOCVD  processing,  the  use  of  single-source  complexed-metal 
compounds  which  inherently  possess  the  desired  stoichiometric  component  ratio  of  the 
targeted  oxide  has  been  promoted.  Chour  et.a/.'  recently  reported  on  the  vapor 
deposition  of  LiTa03  from  a  single-source,  volatile  double  metal  alkoxide  (LiTafOBut")^). 
Likewise,  Zhang  et.al.^  reported  on  the  CVD  growth  of  MgAl204  thin  films  from  a  flash- 
evaporated  MgAl2(OPr')8  precursor.  A  review  of  the  definitive  text  Metal  Alkoxides^ 
shows  that  there  are  a  number  of  double  metal  alkoxides  which  possess  the  three 
properties  required  for  a  MOCVD  single-source  compound: 

1)  A  stoichiometric  composition  consistent  with  the  targeted  product 
oxide. 

2)  Good  thermal  stability.  Solid/liquid-phase  thermal  decomposition 
reactions  to  the  individual  metal  alkoxides  and/or  other  polymeric  oxo- 
alkoxide  intermediates  would  lead  to  nonstoichiometric  deposition. 

3)  Sufficient  volatility  at  normal  processing  conditions. 

It  is  notable  that  no  bimetallic  alkoxide  which  contains  an  alkaline-earth  (i.e.,  Ba, 
Sr,  Ca) :  Ti  pair  possesses  adequate  thermal  stability  for  CVD  applications.  Furthermore, 
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and  due  to  the  propensity  of  the  alkaline-earths  to  attain  high  coordination  numbers,  all  of 
referenced  alkaline-earth-based  double  metal  alkoxides  possess  metallic  stoichiometric 
ratios  inconsistent  with  commercially  significant  oxide  compositions.  Given  the  current 
interest  in  thin  film  ferroelectric  materials  based  on  BaTiOj,  it  would  be  of  interest  to 
investigate  methods  whereby  the  chemical  composition,  thermal  stability,  and  volatility  of 
Ba:Ti  alkoxide-based  systems  may  be  modified/enhanced.  In  this  study,  the  synthetic 
chemistry  and  properties  of  B-diketonate-modified  Ba:Ti  alkoxides  were  studied.  As  the 
basis  for  this  choice,  a  brief  summary  of  current  work  relating  to  the  development  of 
Group  II  source  compounds  is  presented.  In  addition,  background  information  relating  to 
the  synthesis  of  homo-  and  hetero-  metal  alkoxides  and  the  reaction  kinetics  of  metal 
alkoxides  with  P-diketones  is  briefly  surveyed. 

Precursor  Development  of  Croup  II  Source  Compounds  for  CVD  Applications 

In  order  to  ensure  reasonable  epitaxial  growth  rates  at  nominal  substrate 
temperatures,  source  compounds  should  transport  in  the  vapor  phase  at  processing 
temperatures  less  than  200  °C  and  pressures  greater  than  5  torr.^  This  constraint  presents 
a  significant  problem  for  the  case  of  the  alkaline  earth  source  compounds  as  their 
propensity  to  attain  high  coordination  numbers,  usually  by  forming  multinuclear 
aggregates,  severely  limits  their  volatility.^  As  such,  only  those  alkaline  earth  compounds 
with  multidentate,  sterically  encumbered  anions  have  proven  useful  for  MOCVD 
applications. 

Recent  synthetic  chemical  research  has  focused  on  both  the  fluorinated  and 
fluorine-free  P-diketonates  which  have  the  general  structural  form  as  shown  below. 


As  illustrated,  a  monomeric  alkaline  earth  P-diketonate  would  be  coordinatively 
unsaturated  with  a  coordination  number  (CN)  of  4,  where  a  CN  of  6  to  10  is  the 
thermodynamically-stabilized  structural  form.  As  such,  intermolecular  association  takes 
place  resulting  in  the  formation  of  molecular  clusters  which  adversely  effects  the  volatility. 
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Volatility  has  been  reported  to  increase  with  increased  encapsulation  of  the  metal  ion 
through  the  utilization  of  bulky  side-group  ligands.^  Additionally,  the  use  of  fluorinated 
alkyl  groups  results  in  a  decrease  of  intermolecular  interactions  which  consequently 
increases  volatility.  For  the  case  of  fluorinated  precursors,  the  resulting  enhancement  in 
volatility  is  at  the  expense  of  a  more  complicated  decomposition  reaction  pathway  to  the 
oxide  through  the  hydrolysis  of  a  thermally  stable  fluoride  salt.  As  such,  more  recent 
studies  have  concentrated  on  the  non-fluorinated  source  compounds  such  as  those  based 
on  2,2,6,6-tetramethyl-3,5-heptanedionate  (thd). 

Ba(thd)2  forms  a  tetrameric  cluster  in  the  solid  state  which  is  maintained  in  the 
transformation  to  the  gas  phase. 5  ®  While  Ba(thd)2  is  air  stable,  significant  decomposition 
through  hydrolysis  and  carbonate  formation  preclude  synthesis  through  the  standard  "wet" 
method^.  Pure  Ba{thd)2  may  be  synthesized  by  direct  reaction  of  the  metal  or  metal 
hydride  with  2  equivalents  of  the  free  P-diketone  (Hthd).  Based  on  nominal  growth  rate 
requirements,  use  of  pure  Ba(thd)2  as  a  barium  precursor  requires  sublimation 
temperatures  in  slight  excess  of  200  °C.  Prolonged  heating  at  these  temperatures  induces 
thermal  decomposition  processes  which  result  in  the  significant  deterioration  of  the  vapor 
transport  rate.  Methods  to  induce  dissociation  of  the  tetrameric  Ba(thd)2  clusters  to 
monomeric  species  would  serve  to  enhance  their  volatility,  and  various  methods  to  induce 
coordinative  saturation  through  adduct  addition  have  been  reported.  For  example, 
evaporation  of  Ba(thd)2  in  the  presence  of  oxygen  (e.g.,  polyethers)  or  nitrogen  (e.g.,  THF, 
HN3,  N(CH3)3)  donor  ligands  has  been  reported  to  enhance  the  thermal  stability  and  the 
volatility  of  the  compound  such  that  increased  mass  transport  rates  may  be  maintained  at 
lower  evaporation  temperatures.^ 

Synthesis  and  Properties  of  Homo-  and  Hetero-  Metal  Alkoxides 

The  reader  is  referred  to  the  text  by  Bradley  etal^  for  general  information  regarding 
the  synthesis  and  properties  of  metal  alkoxides,  as  the  background  information  included 
herein  is  specific  to  the  alkoxides  of  the  alkaline  earths  and  heterobimetallic  alkoxides  of 
the  alkaline  earths  and  titanium. 

The  alkaline  earth  alkoxides  may  be  synthesized  via  the  direct  reaction  of  the  metal 
and/or  metal  hydride  with  the  alcohol.  Simple  [M(OR)3]n  (R  =  Me,  Et,  Pr",  Pr')  alkoxides 
of  group  2  metals  are  generally  polymeric,  insoluble  in  organic  solvents,  and  non-volatile. 
Lowering  the  degree  of  association  has  been  shown  to  increase  the  solubility  and 
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volatility,  and  the  role  of  steric  factors  on  the  extent  of  oligomerization  has  been  actively 
studied.  Recent  strategies  reported  in  the  literature  to  induce  dissociation  include  the 
utilization  of  sterically  demanding  ligands  (e.g.,  2,6-But2C6H30-,  EtaCO-,  C{Me3)2CHO-, 
etc.)  and  chelating  ligands  (e.g.,  Me2NCH2CH20-,  ROCH2CH2O-,  CH3{OCH2CH2)nO-, 
etc.).’°  Relative  to  the  2-methoxyethoxide  chelating  ligand,  it  is  important  to  note  that 
there  is  an  additional  trend  for  this  ligand  to  act  as  a  bridging  ligand  which  may  lead  to  the 
formation  of  large  oligomers.”  This  property  has  been  the  basis  for  the  use  of  2- 
methoxyethoxide  as  a  assembling  ligand  for  the  formation  of  heterometal  lie  alkoxides. 

As  compared  to  associated  homo-metallic  alkoxides,  the  heterobimetallic  alkoxides 
exhibit,  in  general,  an  enhanced  stability  probably  resulting  from  the  greater  stability  of 
the  M{p-OR)2M'  bridges  between  dissimilar  metals.  In  addition,  a  number  of 
heterobimetallic  alkoxides  containing  an  alkaline  earth  metal  constituent  are  soluble, 
volatile,  and  monomeric  -  which  contrasts  sharply  with  the  insoluble,  polymeric,  and  non¬ 
volatile  nature  of  the  associated  homo-metallic  alkoxide.  Bimetallic  alkoxides  containing 
an  alkaline  earth  constituent  have  been  synthesized  by  the  Lewis  acid-base  interaction  of 
the  simple  alkoxide  of  these  metals  with  the  alkoxides  of  less  electropositive  elements. 

D.A.  Payne  et.a/.’^  recently  reported  on  the  synthesis  of  barium  titanium  methoxyethoxide 
[BaTi(0CH2CH20CH3)g]  via  this  method.  In  previous  work”,  the  structure  of  a  similar 
lead  titanium  alkoxide  was  reported  to  be  consistent  with  that  of  a  linear  polymeric 
network  with  an  empirical  chemical  formula  of  PbTi(0CH2CH20CH3)6.  This  work 
highlights  the  fact  that  chelating  ligands,  such  as  2-methoxyethoxide,  not  only  act  as 
bridging  ligands  conducive  to  the  formation  of  a  heterobimetallic  alkoxide,  but  also 
induce  the  formation  of  large  oligomers.  The  resulting  polymeric  structures  are,  in 
general,  nonvolatile  and  as  such  bimetallic  2-methoxyethoxides  are  not  suitable  source 
compounds  for  CVD  applications. 

Chemical  Reactions  of  Metal  Alkoxides  with  ^-Diketones 

The  enolic  form  of  a  p-diketone  contains  a  reactive  hydroxyl  group  and  this 
molecule  reacts  very  readily  with  metal  alkoxides,  which  may  be  represented  by  the 
following  general  reaction^: 


M{OR)^  +  xR'COCH2COR"  ^  M(0R)„.^(R'C0CHC0R'\ 
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The  reaction  chemistry  of  titanium  (IV)  alkoxides  with  P-diketones  has  been  well  studied 
and  the  reader  is  referred  to  the  literature’-*''^  for  details. 

Experimental  Procedure 

The  following  chemicals  were  used  without  further  purification:  titanium 
isopropoxide  (Alfa/Johnson  Matthey,  MA),  barium  (Alfa/Johnson  Matthey,  MA),  and 
2,2,6,6-tetramethyl-3,5-heptanedione  (Strem  Chemical,  MA).  Isopropanol  (Alfa/Johnson 
Matthey,  MA)  and  thiophene-free  benzene  (Alfa/Johnson  Matthey,  MA)  solvents  were 
dried  and  purified  by  distillation  over  calcium  hydride.  All  synthesis  work  was  conducted 
in  a  controlled  Ar-atmosphere  glovebox. 

Barium  isopropoxide  was  prepared  by  reacting  barium  metal  with  excess 
isopropanol  in  benzene  at  room  temperature.  The  di ketone-modified  alkoxide  was 
prepared  by  adding  2,2,6,6-tetramethyl-3,5-heptanedione  in  a  1 :1  molar  ratio  to  barium 
and  stirring  for  12  hours  at  room  temperature. 

Titanium  diisopropoxide  di-2,2,6,6-tetramethyl-3,5-heptanedionate  was  prepared 
by  reacting  titanium  (IV)  isopropoxide  with  2,2,6,6-tetramethyl-3,5-heptanedione  in  a  1:2 
molar  ratio  in  benzene  at  room  temperature  for  1 2  hours. 

The  Ba:Ti  product  complexes  were  prepared  by  reacting  stochiometric  quantities  of 
the  barium  precursor  solution  to  the  titanium  precursor  solution  at  the  conditions  of 
atmospheric  reflux  (T,  80  °C)  for  approximately  1  hour.  Solvents  were  removed  from 
product  mixtures  by  vacuum  distillation  techniques.  The  resulting  precipitates  were 
analyzed  by  thermalgraviometric  techniques. 

Sublimation  trials  were  conducted  in  a  commercial  vacuum  sublimator  (Aldrich 
Chemical  Co.,  Milwaukee,  Wl).  Chemical  analysis  of  both  sublimate  and  retentate 
samples  was  conducted  by  electron  diffraction  spectroscopy  (EDS). 

Results  and  Discussion 


The  reaction  of  barium  isopropoxide  and  titanium  isopropoxide  with  2,2,6,6- 
tetramethyl-3,5-heptanedione  exhibited  a  slight  exotherm  at  the  mild  conditions  employed 
in  this  study.  Ail  reaction  products  were  soluble  in  benzene.  Removal  of  the  solvent 
benzene  by  room-temperature,  vacuum  distillation  resulted  in  the  dramatic  precipitation 
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of  a  solid  phase  which  was  assumed  to  be  the  desired  barium-titanium  precursor 
compound. 


The  oxidative  decomposition  kinetics  of  the  barium-titanium  precursor  were 
analyzed  by  thermalgravinometric  techniques  and  compared  to  commercial  Ba(THMD)2 
and  Ba{0'Pr)2.  In  general,  the  decomposition  profiles  (i.e.,  [wt.  vs.  T]  and  [d(wt.)/dT  vs.  T]) 
of  the  prepared  Ba-Ti  precursor  were  similar  to  the  commercial  Ba{THMD)2,  and  served  to 
at  least  empirically  confirm  the  successful  complexation  of  diketone  to  the  metal 
alkoxides. 

Initial  attempts  to  purify  the  precipitated  product  compound  by  vacuum 
sublimation  resulted  in  deposition  of  two  distinct  solid  phases  -  a  white,  needle-like, 
crystal  structure  and  a  fine,  yellow  powder.  EDS  analysis  of  a  composite  sample  of  the 
sublimate  showed  a  molar  Ba:Ti  ratio  of  approximately  1 :3  (note:  target  ratio,  1:1).  By 
carefully  controlling  the  sublimation  rate  (through  T  control),  sublimate  samples  of 
essentially  the  pure  needle-like  crystal  phase  were  isolated.  EDS  analysis  of  these  samples 
indicated  a  molar  Ba:Ti  ratio  of  approximately  1 :6.  Attempts  to  re-purify  this  sublimate 
through  a  second  vacuum  sublimation  process  resulted  in  the  deposition  of  a  Ba-free 
sublimate.  Given  that  these  initial  experiments  were  conducted  at  relatively  high 
temperature  (200-220  °C),  it  is  surmised  that  thermal  decomposition  processes  were  a 
significant  complicating  factor  to  the  overall  process. 

Conclusion 

The  results  of  this  initial  study  were  preliminary  and  incomplete,  and  as  such, 
further  work  must  be  conducted  in  order  to  adequately  assess  the  viability  of  the  proposed 
approach.  While  samples  from  initial  vacuum  sublimations  were  found  to  contain  both 
barium  and  titanium,  re-sublimation  of  this  purified  material  resulted  in  a  sublimate  which 
contained  only  titanium.  Furthermore,  we  did  not  attempt  to  fully  analyze  the  structural 
features  of  the  synthesized  products  during  this  initial  exploratory  study,  and  as  such,  do 
not  make  any  claim  relative  to  the  molecular  structure  of  product  compounds.  It  is 
notable,  though,  that  the  oxidative  decomposition  kinetics  of  the  product  samples  were 
similar  to  commercial  Ba(THMD)2  ^^d  dissimilar  to  commercial  Ba(OF*r)2. 

In  retrospect,  and  based  on  the  subsequent  detailed  analysis  of  the  current  technical 
literature,  the  use  of  a  starting  alkoxide  formula  based  on  the  utilization  of  a  sterically 
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demanding  ligand  or  a  chelating  ligand  is  preferred  in  order  to  enhance  the  formation  of 
the  initial  heterobimetal  lie  alkoxide.  To  this  end,  it  is  suggested  that  the  chelating  and 
bridging  ligand  2-methoxyethoxide  be  utilized  during  the  complexation  of  the  barium  and 
titanium  alkoxides  in  future  experimental  studies. 
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PHOTOREFRACTIVE  DEVELOPMENT  AND  APPLICATIONS  OF 

InPAND  BSO 

Gregory  J.  Salamo 
Professor  of  Physics 
University  of  Arkansas 

Abstract 

An  investigation  into  the  photorefractive  properties  of  InP  was  initiated.  Experimental 
apparatus  for  carrying  out  an  investigation  was  built  at  Hanscom  Air  Force  Base  using  a  c.  w. 
YAG  laser  for  steady-slate  measurements  while  complimentary  apparatus  was  built  at  the 
University  of  Arkansas  using  a  picosecond  tunable  Ti-Sapphire  laser  for  transient  studies.  Initial 
measurements  on  both  energy  exchange  and  diffraction  efficiency  were  made  indicating  a  high 
potential  for  InP  as  a  photorefractor  in  the  nezu'  infrared.  Results  also  indicate  that  the 
photorefractive  effect  will  be  an  effective  diagnostic  tool  for  characterization  of  InP.  Because  of 
this  potential  the  collaboration  between  Arkansas  and  Hanscom  will  continue  beyond  the  summer 
program  period. 


22-2 


PHOTOREFRACTIVE  DEVELOPMENT  AND  APPLICATIONS  OF 

InPAND  BSO 

Gregory  J.  Salamo 


I.  INTRODUCTION 

Over  the  last  few  years  we  have  carried  out  experimental  and  theoretical  investigations  on 
the  understanding  and  applications  of  the  photorefractive  effect  in  the  ferroelectric  oxides,  such  as, 
barium  titanate  (BaTi03)  and  strontium  barium  niobate  (SBN).  Our  research  has  lead  to  an 
improved  understanding  of  these  materials  and  we  have  demonstrated  several  novel  applications. 
For  example,  we  have  uncovered  the  affect  of  temperature,  applied  d.c.  and  a.c.  electric  fields,  and 
the  role  of  dopant  concentration  and  type  on  the  magnitude  of  the  photorefractive  index  change  and 
the  speed  with  which  it  is  produced.  We  have  demonstrated  the  confinement  or  trapping  of  30 
micron  diameter  laser  beams  through  1  centimeter  samples  and  a  quasi-phase  matching  technique  to 
produce  real-time  tunable  second  harmonic  generation  at  any  wavelength  in  a  single  crystal  without 
temperature  or  angle  tuning.  Our  research  has  produced  a  demonstration  of  a  type  of  optical  limiter 
which  has  the  unique  feature  of  limiting  laser  light  while  maintaining  normal  visibility.  We  have 
demonstrated  a  photorefractive  spatial  light  modulator,  image  subtraction,  addition,  division, 
correlation,  storage,  color  storage,  contrast  enhancement,  phase  conjugation,  -all  with  spatial 
resolution  exceeding  a  hundred  lines  per  millimeter.  Our  most  recent  study  has  produced  a  real¬ 
time  nondestructive  material  testing  scheme  using  a  holographic  technique  packaged  in  a  small 
portable  device. 

II.  DISCUSSION  OF  THE  PROBLEM 

While  each  of  these  applications  have  been  exciting  to  produce  in  the  laboratory,  all  is  not 
well  with  the  field  of  photorefraction.  The  ferroelectric  oxides  are  not  striation-free,  they  have  low 
carrier  mobilities,  and  high  dielectric  constants.  Moreover,  it  is  difficult  to  control  the  acceptor  and 
donor  concentrations  independently,  and  their  ferroelectric  nature  produces  unwanted  effects  with 
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applied  d.c.  or  a.c.  fields.  For  these  reasons  the  sillenites,  such  as  bismuth  silicon  oxide  (BSO) 
and  the  III-V  semiconductors,  such  as  InP,  are  attractive  materials  for  many  of  the  applications  we 
have  investigated.  BSO  and  InP,  for  example,  have  high  mobilities,  low  dielectric  constants, 
show  no  complications  with  applied  d.c.  and  a.c.  electric  fields,  and  have  the  potential  for 
independently  controlled  acceptor  and  donor  concentrations.  This  is  particulariy  true  for  InP  where 
due  to  the  importance  of  defects,  the  electronic  and  optoelectronic  industry  have  developed 
techniques  for  controlling  the  acceptor  and  donor  concentrations.  Finally,  both  BSO  and  InP  are 
produced  with  high  optical  quality.  Rome  Laboratory,  for  example,  produces,  if  not  perfect,  near 
perfect  BSO  and  InP  .  These  materials,  therefore,  are  an  effective  starting  point  for  the 
development  of  controlled-high  quality,  photorefractives  which  in  turn  will  lead  to  improved 
application  potential. 

This  summer  we  began  a  collaboration  with  Rome  Laboratory  to  architect  these  materials 
using  measurements  of  their  photorefractive  properties  and  their  performance  in  applications  as  a 
guide.  The  architecture  of  these  materials  involves  experimenting  with  type  and  concentration  of 
impurities.  For  example,  for  InP,  (see  figure  on  next  page)  the  elements  Ge,  Si,  Sn,  S,  Se  might 
be  typical  species  which  would  lead  to  shallow  donor  and  acceptor  levels  while  Ce,  Cu,  Cr,  Fe, 
Ca,  Zn,  Mo,  might  be  typical  species  leading  to  deep  donor  and  acceptor  levels.  As  impurity 
concentration  of  lO^^  per  cm^  can  give  a  resistivity  as  high  as  10^  ohm-cm  and  a  dielectric 
relaxation  below  1  ms.  Deep  traps  reduce  the  free  carrier  lifetime  of  photoexcited  carriers  by 
providing  a  high  probability  of  retrapping.  Meanwhile,  the  density  of  deep  traps  gives  rise  to 
optical  absorption  at  wavelengths  above  the  band  edge.  Careful  manipulation,  therefore,  of  the 
dopant  concentration  can  tailor  the  material  response  time,  optical  absorption,  photo-carrier 
mobility  and  sensitive  wavelength.  For  example,  one  interesting  question  we  can  answer  about 
our  samples  is  the  relative  importance  of  electron  versus  hole  carriers.  Due  to  the  high  mobility  of 
electrons  over  holes,  the  two  processes  can  be  separated  using  femtosecond  output  optical  pulses 
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from  a  Ti-sapphire  mode-locked  laser  for  time  resolved  measurements  of  the  "writing"  of  a 
photorefractive  grating.  Another  interesting  point  is  that  since  InP  substrates  are  used  in  electronic 
and  opto-electronic  devices  the  photorefractive  effect  can  act  as  a  non-contact  probe  for  mapping 
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out  the  optical  and  electrical  behavior  of  the  substrate.  With  high  spatial  and  temporal  resolution 
the  resistivity,  absorption,  sign  of  photocarriers,  drift  length  and  diffusion  length,  could  be 
mapped.  Likewise  BSO,  is  of  intense  interest.  It  is  for  example  faster  and  more  sensitive  than 
lithium  niobate  LiNbOs  and  consequently  would  perform  extremely  well  in  the  infrared  spectral 
region  for  image  processing  applications. 

III.  PHOTOREFR ACTIVE  CONCEPT 

In  the  photorefractive  effect  the  index  of  refraction  is  changed  by  a  spatially  modulated 
light  intensity  pattern.  Optically  generated  charge  carriers  migrate  due  to  diffusion,  drift,  and  the 
photovoltaic  effect  and  produce  a  charge  separation.  The  charge  separation  then  results  in  a  space 
charge  field  which  changes  the  index  of  refraction  via  the  electro-optic  effect.  In  our  model,  the 
photorefractive  material  is  doped  with  donor  and  acceptor  traps.  When  photons  of  sufficient 
energy  enter  the  crystal,  donors  are  excited  into  the  conduction  band  where  they  are  transported 
until  they  are  re-trapped.  The  process  is  modeled  using  the  following  transport  equations: 


/  =M(No-N^)-YeneNS-s,IN^ 

+yM^u-K) 

(la) 

je  =  en,M  +  MBTVn, 

(lb) 

^  =  ;Vj.  +  s.l(N„-N*)-r,nX 

(Ic) 

VE=-^(n.+NA-N'D-nJ 

(Id) 

jh  =  enhIl,E-p,k3TVn, 
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In  these  expressions  is  the  total  number  of  donor  species  and  the  number  of  ionized 
donors  available  for  electron  recombination  and  hole  photoexcitation,  and  are  the  electron  and 
hole  current  densities  and  E  the  total  internal  field,  n^  and  n^  are  the  densities  of  free  holes  and 
electrons  and  s^  and  s^  their  excitation  cross  sections,  and  y^  are  the  recombination  rates  and 
and  the  carrier  mobilities.  The  density  of  compensative  acceptors  is  and  is  equal  to  the 
density  of  ionized  donors  in  the  dark. 

These  equations  describe  the  formation  of  the  space  charge  field  which  is  then  responsible 
for  the  change  in  index  of  refraction  via  the  electro-optic  effect  described  by 


where  r^j^  is  the  electro-optic  coefficient. 

IV.  TWO  BEAM  COUPLING  OR  METHODOLOGY 

An  important  character  of  two-beam  coupling  in  a  photorefractive  material  is  the  energy 
exchange  from  one  laser  beam  to  the  other.  The  steady-state  value  of  the  energy  transfer  signal 
(ETS)  and  the  diffraction  signal  (DS)  due  to  the  grating  inside  the  crystal,  can  provide  interesting 
information  about  a  photorefractive  material.  The  origin  of  energy  exchange  in  two-beam  coupling 
is  the  phase  shift  between  the  grating  and  the  intensity  fringe  pattern. 

The  phase  shift  between  the  grating  and  the  fringe  pattern  can  be  determined  by  measuring 
the  energy  exchange  signal  and  the  diffraction  signal.  The  phase  shift  measurement  can  be  a 
technique  for  determining  photorefractive  constants  of  a  crystal.  One  important  application  of  this 
technique  is  to  measure  the  crystal  trap  number  density  by  applying  a  DC  field  to  a  crystal  and 
determining  the  resulted  phase  shift  between  the  grating  and  the  fridge  pattern  as  a  function  of  the 
magnitude  of  the  DC  field. 

A.  Basic  Theory  For  Phase  Shift  Measurement 

In  a  photorefractive  material,  the  two  light  waves  which  are  expressed  as 
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E,  (x,z)=Eioexp[i(k^x+k^z)]  (2a) 

Ej  (x,z)=E2oexp[i(-k^x+k^z)]  (2b) 

interact  at  an  angle  20  in  the  volume  phase  grating,  where  k,^  and  k^  are  components  of  the  wave 

vector.  If  <|)  is  the  phase  shift  between  the  grating  and  the  interference  pattern  the  output  waves 
will  become 

Ej(d)  =EjoCos(Kd)  -  iE2oe^''’sin(Kd)  (3a) 

Ej  (d)  =  Ejo"'’cos(Kd)  -  iE,oSin(Kd)  (3b) 

where  k  is  the  coupling  constant  and  d  is  the  thickness  of  the  crystal. 

Using  these  two  equations  the  optical  intensities  after  passing  through  a  crystal  are  given  by 
l^fd)  =  E,  (d)E;  (d) 

=  I,cos^(Kd)  +  IjSin^(Kd)  -  (lTlD)‘'^sin(2Kd)sin(t)  (4a) 

Ud)  =  E,(d)E,*  (d) 

=  IdCos^(Kd)  +  I,sin^(Kd)  +  (yD)''^sin(2Kd)sin(j)  (4b) 

where  I.j;=Ej(,^  and  Ip=E2Q^  are  incident  intensities  of  two  beam.  In  this  expression  the  first 
term  corresponds  to  the  transmitted  component,  the  second  term  is  the  diffracted  component  due  to 
the  grating,  and  the  third  term  is  the  energy-exchange  component  between  two  beams.  These 

terms  are  depicted  in  the  following  figures.  As  a  result,  the  diffraction  efficiency  T|  will  be  given 
as 

q  =  sin^(Kd)  (5) 

If  the  incident  intensities  of  both  beams  are  equal  we  can  get  the  energy-exchange  efficiency 
e  in  terms  of  diffraction  efficiency  q  as 


e  =  2[q  (l-q)]‘'^sin(|) 

(6) 

Then 

sintj)  =  e/{2[q  (l-q)]*'^} 

(7) 

or 

sin(t)=I,/{2[I,(lo-y]‘'^} 

(8) 
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£  =  Eq  +  Ae  Cos(2k^+(p) 


E|(d)=EjQCos(Kd)  -  iE2oe“*^Sin(Kd) 

E2=E2oExp[t(-k^x+k,z)]  E2(d)=E2oe'*^Cos(Kd)  -  /Ei^SinCKd) 
Coupling  Coefficient  k  =  jcAe  /  2(e)  Cos  0 


Two  Wave  Coupling  2 
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where  is  the  energy  exchange  signal,  which  is  the  intensity  change  of  the  detected  beam  after  two 
burned  on;  Ij  is  the  diffraction  signal,  which  is  the  intensity  measured  of  the  ditected  beam 
immediately  after  the  incident  detected  beam  blocked;  and  I,,  the  intensity  of  the  detected  beam 
without  coupling. 

B .  Energy-Transfer  Measurement  And  Determination  Of  The  Phase  Shift 

Between  The  Grating  And  The  Fringe  Pattern 

The  experimental  set  up  for  two  beam  coupling  is  shown  in  the  following  figure.  The 
YAG  or  Ti-Sapphire  laser  beam  is  split  into  two  beams  I.j.  and  Ip  that  intersect  inside  a  thin  sample 
(held  at  constant  temperature)  such  that  the  grating  vector  is  parallel  to  the  c-axis  and  the  applied 
field  (see  crystal  holder  on  next  page).  The  beam  splittes  50%  to  50%  to  make  the  beam  I.j.  having 
the  same  intensity  as  Ip.  The  angle  20  between  the  two  beams  is  about  10°.  The  beam  I.j,(d)  is 
detected  with  a  photodiode.  For  all  our  observed  intensity  values  the  photodiode  was  linear.  To 
avoid  the  observation  of  multi-exponential  growth  and  decay  times,  the  detected  beam  is  expanded 
with  a  beam  expander  so  that  only  the  uniform  portion  of  the  beam  is  detected  by  the  photodiode. 
The  output  of  the  diode  is  fed  into  the  602A  Digitizing  Signal  Analyzer  or  a  Computer  Data 
Acquisiton  System.  After  turning  on  both  beams  a  growth  curve  is  recorded  and  the  energy 
exchange  signal  le  can  be  measured.  When  the  grating  reaches  a  ste^y  state  we  then  block  the 
detected  beam.  The  transmitted  beam  immediately  went  to  zero,  and  the  diffraction  signal  Ij  was 
detected.  Finally,  the  intensity  of  the  transmittted  beam  was  measured.  Using  I^,  I^,  and  Ij  into 
Eq.  (8)  we  can  then  calculate  the  phase  shift  between  the  grating  and  the  interference  pattern  in  the 
crystal. 

C .  Determination  of  Trap  Number  Density  By  Phase  Shift  Measurement 

The  trap  number  density  of  a  crystal  plays  a  key  role  in  the  photorefractive  response 
time.  Therefore,  measuring  accurately  becomes  an  important  topic  in  the  research  on 
photorefractive  phenomena.  We  have  developed  a  new  technique  for  the  measurement  of  by 
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Experimental  Apparatus  for  Simultaneously  Monitoring  Energy  Exchange  and  Diffraction  Efficiency 


Nylon  Screws 


Sample  Holder  for  Stabilizing  Temperature  While  Applying  External  Field 
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applying  a  DC  field  across  a  crystal  and  determining  the  resulting  phase  shift  between  the  grating 
and  the  fringe  pattern. 

After  applying  a  DC  field  across  a  crystal  and  measuring  I^and  1^,  we  can  calculate  both 
the  sin  (j)  and  the  tan  ^ .  In  order  to  get  accurate  results  we  can  vary  the  magnitude  of  DC  field  and 
measured  and  for  each  DC  field  value  E^. 

A  second  method  for  measureing  is  to  measure  the  two-beam  coupling  on  a  weak  beam 
Ip  The  intensity  along  its  propagation  direction  is  given  by 

I,  l+me^^*" 

where  m  =  Iqj/Iqj,  a  ratio  of  the  weak  beam  intensity  to  the  strong  beam  intensity. 
Experimentally,  we  measured  Ij^,  Ij  and  m,  and  then  calculated  where 

-  „  27tR^^KBT  K„ 

Y^eff  =  R-r - — - - — ^ ^  ( 10) 

^ncosOe  1+{kJK„) 

Retf  =  11*^*33^08^0  -  no^'rjjSin^G  for  an  extraordinary  ray.  Kg  is  grating  constant  and 


KbTV  e 


provided  C=47ce^/KBT,  and  R  is  the  measure  of  the  electro-hole 


competition  in  the  formation  of  the  space-charged  field. 


By  defining  f(e)=cose--^sin^e  we  will  have 


YL./(e)a- 


- T-S- 


By  plotting  the  data  of  yL^j./(6)  vs  Kg  and  fitting  the  data,  we  can  then  obtain  a  value  of 


Ne^e. 


If  electrons  and  holes  occur  simultaneously  in  the  charge  transport  process,  there  will  be  a 
electron-hole  competition  in  the  formationof  the  space-charge  field  which  will  lead  to  a  reduction  in 
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strength  of  the  photorefractive  effect.  Suppose  both  electrons  and  holes  can  be  excited  from  (and 
to)  a  single  impurity  band,  then  by  employing  band-transport  model  a  first-order  differential 
equation  for  the  time  evolution  of  the  space-charge  field  can  be  derived.  When  there  is  no  voltage 
applied  to  the  crystal  the  amplitude  of  the  steady-state  space-charge  field  is 

imkoT  k 


E.  =- 


l-l-k'/k 


■R 


(12) 


where  R  =  (1-C)/(1+C) 

with  C  =  s,N^(kg^+k/)/[sJNi,-N  J(k/+K,^)] 

If  the  charge  carrier  is  only  electron  C=0,  R=l,  and  if  the  charge  carrier  is  only  hole 
C  -»o«,R  =  -1.  Therefore,  R=1  or  -1  indicates  no  electro-hole  competition.  A  plot  of  KJy  vs 
gives  us  a  straight  line.  The  slope  and  the  y-intercept  of  the  line  enable  us  to  determine  the 
electro-hole  competition  R. 

RESULTS 


During  the  summer  we  have 

(  i)  Set-up  beam  coupling  apparatus  at  Hanscom  Air  Force  Base  and  taken  initial  data 
on  several  samples. 

( ii)  Set-up  two-beam  coupling  and  4-wave  mixing  apparatus  at  Arkansas.  This  system 
is  capable  of  continuous  wave  and  picosecond  investigations  from  .88}i  to  Ijx. 

(iii)  Set-up  a  c.w.  two-beam  coupling  system  operating  at  1.3|i  at  Arkansas. 

(  iv)  Obtained  two-beam  coupling  and  4-wave  mixing  data  at  Arkansas.  Typical  Data  is 
shown  below. 

(  v)  Carried  out  an  extensive  review  of  the  literature  in  order  to  take  advantage  of  all 
previous  studies  on  InP. 

(  vi)  Constructed  Computerized  Data  Acquisition  System. 
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Two- Wave  Mixing  Apparatus  Set  Up  at  Rome  Laboratories,  Hanscom  Air  Force  Base 


Using  initial  results  we  obtained  gains  on  the  order  of  .3cm'*  for  applied  fields  of  5kv/cm. 
Using  the  parameters  of 

n  =  3.29 
e=  12.1 

r^j  =  -1.53  pm/v 
an  =  4  X  10'‘®cm^ 
ap  =  3  X  10'‘*cm^' 

we  are  now  in  a  position  to  make  measurements  on 
(  i)  electron-hole  competition 

( ii)  charge  of  major  carrier 

(iii)  response  time 

( iv)  gain  versus  intensity  and  applied  field 
(  v)  diffraction  efficiency 

(  vi)  phase  of  grating 
(vii)  donor  and  acceptor  concentrations 
CONCLUSIONS 

We  are  now  in  the  process  of  making  these  measurements  and  developing  modeling  for 
comparison.  As  soon  as  we  develop  an  understanding  of  the  photorefractive  properties  of  InP 
which  is  comparable  to  our  understanding  of  the  ferroelectrics  we  will  focus  attention  on  exploring 
some  novel  ideas  and  applications.  These  will  be  explored  with  dr.  David  Bliss  of  Rome  Lab, 
Hanscom  Air  Force  Base  during  the  next  year. 
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The  TkWWW  Robot:  Beyond  Browsing 


Scott  Spetka 
Assistant  Professor 
Department  of  Computer  Science 
State  University  of  New  York 
Institute  of  Technology  at  Utica/Rome 


Abstract 


The  TkWWW  Robot  extends  the  functionality  of  the  TkWWW  browser.  It  can  also  be  used  as  a  standalone  agent 
that  processes  queries  on  the  WWW.  In  either  case,  the  robot  traverses  the  web  according  to  functions  described  us¬ 
ing  Tcl  extensions  developed  for  the  TkWWW  browser  and  the  TkWWW  robot.  Tcl  language  extensions  allow 
novice  users  to  direct  a  search  based  on  key  words  and  HTML  data  types  or  to  invoke  predefined  queries.  Expert 
users  can  add  their  own  Tcl  extensions  for  use  in  more  sophisticated  searches.  The  system  can  be  quickly  adapted  to 
support  new  resource  types  that  may  become  available  on  the  WWW,  such  as  live  audio/video  connections  for  inter¬ 
action  with  experts. 

When  TkWWW  robots  are  dispatched  from  the  TkWWW  browser,  their  results  can  be  used  to  guide  further  brows¬ 
ing.  Robots  can  also  be  run  in  the  background  to  build  HTML  indexes,  compile  WWW  statistics,  collect  a  portfolios 
of  pictures,  or  perform  any  other  function  that  can  be  described  by  the  TkWWW  Tcl  extensions.  Searches  can  be  re¬ 
stricted  according  to  URL  names,  limits  on  search  path  lengths,  etc.  The  TkWWW  robot  can  interact  with  its  master 
by  popping  up  windows  to  request  input  according  to  a  Tcl  query  description.  The  main  advantage  that  the  Tk¬ 
WWW  robot  has  over  existing  web  spiders  and  robots  is  its  flexibility  in  adapting  to  virtually  any  criteria  possible  to 
guide  its  search  path  and  control  selection  of  data  for  retrieval. 

The  paper  presents  the  architecture  and  implementation  of  the  TkWWW  robot.  It  describes  Tcl  extensions  imple¬ 
mented  to  support  TkWWW  robot  queries.  The  TkWWW  Robot  is  being  developed  over  the  summer  at  the  Air 
Force  Rome  Laboratory  with  funding  from  the  Air  Force  Office  of  Scientific  Research. 
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The  TkWWW  Robot:  Beyond  Browsing 


Scott  Spetka 


Introduction 

The  global  distributed  WWW  database  is  growing  dramatically,  as  is  the  Internet  itself.  If  the  WWW  fol¬ 
lows  a  growth  pattern  similar  to  the  Internet,  we  can  expect  an  explosion  of  information  and  commercial  applica¬ 
tions  in  the  near  future.  The  quantity  of  data  available  has  already  aided  the  development  of  techniques  for  building 
indexes  to  help  users  direct  their  search.  Many  of  the  index-building  applications  require  exhaustive  search  of  the 
net,  taking  significant  time. 

Most  access  to  WWW  resources  today  is  through  interactive  browsers.  In  these  systems,  a  WWW  query  is 
limited  to  the  use  of  a  single  pointer  to  reference  a  page.  The  page  is  retrieved  and  the  user  determines  whether  fur¬ 
ther  references  can  provide  additional  information.  WWW  public  databases  are  hierarchically  structured.  The  only 
dynanucally  created  state  information  associated  with  a  search  is  a  stack  of  branch  points  that  can  be  used  to  move 
forward  and  backward  through  a  history  of  pages  to  effectively  search  a  hierarchically  structured  information  server. 

WWW  database  structures  and  the  tools  used  for  their  access  must  be  enhanced  in  order  to  provide  effec¬ 
tive  access  to  an  information  database  that  is  growing  rapidly.  The  terms  robot  and  spider 
(http://web.nexor.co.uk/mak/doc/robots/robots.html)  are  used  in  reference  to  automated  browsing  tools  for  access  to 
publicly  accessible  databases  on  the  internet.  A  simple  robot  is  described  in  this  paper  that  is  flexible  enough  to  im¬ 
plement  many  query  processing  techniques.  The  robot  is  analogous  to  the  access  methods  of  a  classical  database, 
but  it  provides  a  much  higher  level  interface  which  simplifies  the  implementation  of  new  query  processing  tech¬ 
niques  in  a  heterogeneous  WWW  environment.  The  paper  also  discusses  the  changes  in  database  structure  that  may 
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be  expected  as  the  current  structuring  techniques  become  overloaded  by  the  increasing  size  of  the  database. 

The  first  section  of  the  paper  discusses  the  issues  that  were  considered  in  deciding  to  build  a  robot.  The  sec¬ 
ond  section  describes  the  rational  for  basing  the  robot  on  Tcl/Tk  and  describes  its  implementation.  The  third  section 
examines  issues  related  to  whether  servers  should  provide  additional  functionality.  Section  4  discusses  research  di¬ 
rections  for  further  experimentation  with  the  TkWWW  Robot.  The  paper  concludes  with  some  general  comments 
from  the  experience  gained  in  implementing  a  robot. 

Issues  for  Consideration 

Locality  of  Reference 

Robots  exploit  the  locality  of  reference  inherent  in  the  natural  organization  of  the  WWW  data.  Home  pages 
reflect  tha  main  interests  of  their  maintainers.  While  there  is  nothing  to  prevent  random  links  and  disorganized  infor¬ 
mation  hierarchies,  links  are  most  often  used  to  present  sub-topics  in  accordance  with  the  underlying  storage  struc¬ 
ture.  The  notion  of  locality  of  reference  suggests  that  a  logical  neighborhood  of  a  node  would  have  a  good  chance 
of  referencing  the  major  sites  that  store  related  data.  Pages  found  in  browsing  the  WWW  are  often  linked  to  pages 
presenting  related  topics.  Links  can  be  used  to  revisit  pages  that  may  be  changing  dynamically 

Improved  storage  organization  would  help  to  reduce  the  search  radius  needed  to  find  most  of  the  related  da¬ 
ta.  Although  a  dynamic  robot  search  does  not  guarantee  that  all  desired  data  is  retrieved,  there  is  no  way  that  any 
query  processor  can  make  that  guarantee  on  today’s  WWW.  The  increasingly  dynamic  nature  of  WWW  servers, 
storing  information  like  live  video  (http://tns-www.ics.mit.edu/vs/demos.html)  and  weather  information  (xmosaic 
http://rs560.cl.msu.edu/weather/)  affects  the  ability  of  indexes  to  be  properly  maintained.  Autonomous  operation  of 
servers  and  reliability  considerations  make  it  impossible  to  deliver  all  possible  responses  to  a  query. 
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Resource  Requirements 


There  is  some  concern  that  automated  browsing  will  either  be  too  slow  to  be  effective  or  will  overload  ex¬ 
isting  Internet  communication  channels.  These  are  the  same  concerns  that  characterize  the  development  of  any  ad¬ 
vanced  research  system.  Software  must  be  developed  on  today’s  systems  to  be  developed  in  time  to  service  the  next 
generation  of  computers.  The  rate  of  development  of  Internet  resources  and  the  drop  in  cost  of  computer  processors, 
memory  and  disk  all  support  the  assertion  that  robots  will  be  able  to  operate  at  reasonable  levels  of  resource  con¬ 
sumption  within  a  short  time. 

The  main  questions  for  resource  consumption  are  how  far  it  is  worth  searching  for  a  result  and  whether  the 
result  is  worth  the  expenditure  of  resources  needed  for  the  search.  In  a  single  machine  environment,  a  database 
scheduler  can  determine  priorities  for  processing  to  provide  for  balanced  and  efficient  operation.  Distributed 
database  systems  cooperate  to  provide  similar  scheduling  functionality.  The  growth  of  the  WWW  has  placed  the 
same  type  of  unpredictable  resource  requirements  on  computer  server  implementations  that  have  characterized  the 
Internet  communications  processors  for  years. 

Client/server  Architecture:  Parallelism 

The  current  client/server  architecture  implemented  in  the  WWW  is  based  on  full  autonomy  of  the  server. 
Client  applications,  like  Mosaic,  TkWWW,  and  Gopher,  request  pages  from  the  server  which  are  then  processed  lo¬ 
cally.  This  architecture  minimizes  the  degree  of  parallel  processing  that  can  occur  because  of  the  synchronous  na¬ 
ture  of  the  read  function.  A  client  may  have  just  a  single  request  outstanding,  even  if  it  is  waiting  on  a  very  slow 
server. 

There  are  two  approaches  to  developing  increased  parallelism.  In  the  first,  servers  do  local  processing  to 
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minimize  the  amount  of  data  that  is  transmitted.  This  approach  requires  that  a  filter  be  sent  to  the  server.  A  natural 
extension  would  be  for  a  more  complex  query  to  be  sent  to  the  server  and  for  additional  steps  in  the  processing  to 
take  place  there.  A  second  approach  can  improve  parallelism  while  having  less  impact  on  autonomy.  Several  ma¬ 
chines  in  a  local  network  can  cooperate  to  share  the  I/O  burden  associated  with  a  query.  This  approach  suffers  from 
the  need  to  share  a  common  network  connection,  limiting  the  degree  of  parallelism  that  can  be  accomplished. 

TkWWW  Robot  -  Using  Robots  on  Today^s  WWW 

The  prototype  TkWWW  Robot  is  the  first  step  in  developing  a  general  purpose  robot  that  can  be  pro¬ 
grammed  to  provide  a  query  processing  interface  to  users.  Users  will,  in  general  be  interested  in  finding  particular 
types  of  HTML  objects  (similar  to  projection  in  relational  databases),  finding  HTML  objects  that  satisfy  a  specific 
search  criteria  (relational  database  selection),  and  looking  for  logical  relationships  between  HTML  objects  scattered 
over  the  WWW  database  (relational  database  join).  The  initial  TkWWW  Robot  implementation  projects  all  links 
found  within  a  set  of  WWW  pages. 

Figure  1  shows  the  TkWWW  Robot  browsing  interface  after  a  search  for  links  within  a  radius  of  2  hops  is 
selected  from  the  robut  pull-down  menu.  Notice  that  the  links  on  the  robot  home  page  (shown  in  the  background)  are 
labeled  to  show  the  links  that  can  be  expected  to  be  found  on  the  corresponding  page.  The  pop-up  dialog  box  con¬ 
tains  all  finks  found  on  the  pages  referenced  by  the  home  page  in  addition  to  all  finks  found  on  the  home  page.  Any 
of  these  links  can  be  selected  directly  for  further  browsing. 
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Design  and  Implementation 


The  TkWWW  Robot  is  an  extension  of  the  TkWWW  browser  which  is  written  in  Tcl/Tk  (URL  ref).  The 
TkWWW  browser  implements  commands  in  the  C  language  which  augment  the  base  Tcl/Tk  command  language  to 
provide  a  convenient  interface  which  can  be  used  to  access  WWW  resources.  The  TkWWW  Robot  uses  TkWWW 
commands  along  with  additional  commands  that  are  specifically  designed  to  support  the  implementation  of  au¬ 
tonomous  agent  data  retrieval.  The  diagram  below  shows  the  layering  of  the  TkWWW  Robot  implementation.  Com¬ 
mands  at  all  levels  of  the  system  are  used  in  developing  query  processing  algorithms  and  automated  browsing  tools. 


TkWWW  Robot 


Figure  2  -  TkWWW  Robot  System  Structure 

In  order  to  provide  maximum  flexibility  in  implementing  additional  robot  functionality,  the  additional  com¬ 
mands  added  for  TkWWW  Robot  should  be  a  simple  as  possible.  The  interface  implemented  in  the  prototype  robot 
operates  on  a  URL  (WWW  page  address).  It  returns  all  links  stored  on  the  page.  The  links  are  returned  as  a  character 
stream  which  is  easily  processed  using  the  powerful  string  handling  commands  of  Tcl.  Figure  3  shows  the  caU  to  the 
HtRobot  function  which  returns  a  list  of  pairs  of  <URL  (page  address),  hypertext  identifier>  to  the  higher-level 
Tcl/Tk  functions  the  HtRobot  function  is  written  in  the  C  language.  The  $page_value  parameter  specifies  the  address 
of  a  page  to  search  for  links. 


set  RobotList  [HtRobot  $page_value] 
Figure  3  -  The  HtRobot  Function  Interface 
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Higher  level  Tcl  functions  loop  through  the  addresses/identifier  pairs  and  produce  an  display  display  of  the 
identifiers  with  appropriate  callbacks  defined  to  invoke  WWW  page  access  commands  when  one  of  the  identifiers  is 
selected.  The  tkW3RobotDialog  function,  implemented  in  the  prototype  TkWWW  Robot  is  designed  to  provide  ac¬ 
cess  to  all  links  within  a  radius  of  1  hop  from  the  current  page  (another  similar  function  implements  the  2  hop  access 
shown  in  figure  1  above).  The  implementation,  shown  in  figure  4  is  simphfied  by  the  use  of  the  tkWSNavigateRobot 
function  which  uses  the  HtRobot  interface  described  above  to  access  the  finks  on  the  specified  page.  The  fist  is  then 
processed  to  display  the  finks  and  set  up  appropriate  callbacks  for  their  access.  This  simple  loop  builds  the  pop-up 
menu  shown  in  figure  1  above. 
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proc  tkW3RobotDialog  {}  { 
global  tkWSRobot 
set  parent . 
set  w  .robot 

#  tkWSNavigateRobot  returns  a  list  of  links,  using  HtRobot 
tkW3NavigateRobot 

#  Set  up  the  pop-up  dialog  window  (some  buttons  not  shown) 
DLG:toplevel  Sparent  $w 
tkW30utputMakeButtons  $w.button_frame  { 

{current  "Add  current"  "tkW3RobotAdd"} 

(edit  "Edit  Title"  "tkW3RobotEditTiUe" } 

} 

#  Draw  box  and  set  up  bindings 
pack  append  $w  $w.button_frame  top 
DLGrdrawJistbox  $w  { } 

DLG:draw_buttons  $w  [list  "Dismiss"  "Help"] 

DLG:bind_button  $w  1  "DLG;hide  $w" 

DLG:bind_button  $w  2  "tkW3HelpNoHelp" 

#  Loop  through  list  of  page  address/description  pairs 

#  and  add  them  to  pop-up  dialog  box 
set  Rent  0 

set  Rlen  [llength  $tkW3Robot(list)] 
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while  {  $Rcnt  <  $Rlen  }  { 

$w.list  insert  end 

[lindex  $tkW3Robot(list)  [expr  $Rcnt  +  1  ]  ] 
incr  Rent  2 

} 

#  Display  the  pop-up  box 
DLG:show  $parent  $w  } 

Figure  4  -  Tcl/Tk/TkWWW  Processing  for  a  List  of  Links 
Implementation  Problems 

The  current  structure  of  the  system  made  it  somewhat  difficult  to  isolate  the  links  that  were  found  on  each 
HTML  page.  The  functions  for  parsing  and  generation  of  the  bitmap  display  are  integrated  rather  than  being  per¬ 
formed  separately.  The  current  implementation  identifies  links  during  the  parsing  process  and  copies  them  into  a 
global  list  structure  for  use  in  the  HtRobot  interface  function.  The  call  to  the  parser  is  implemented  in  the  same  way 
as  a  call  to  display  a  page.  Page  display  functions  accept  the  URL  address  of  a  page  as  a  parameter  but  default  to  the 
current  page  when  none  is  given. 
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The  TkWWW  Robot:  a  Distributed  Database  Query  Processor 


Eventually,  WWW  will  support  distributed  query  processing  with  parallel  execution  of  server  functions  to 
support  queries  generated  by  client  programs  that  provide  the  parameters  for  a  dynamic  search  of  the  WWW.  To¬ 
day’s  concern  over  limited  processing  resources  and  bandwidth  will  be  reduced  as  the  resources  become  available  in 
the  Internet,  or  its  successor,  to  support  the  ideas  and  software  being  developed  in  research  laboratories  today.  Addi¬ 
tion  of  resources  to  local  processing  resources  to  provide  adequate  query  processing  support  will  be  driven  by  the 
need  for  companies  to  compete  commercially.  Providing  convenient  query  processing  support  will  be  comparable  to 
providing  a  facility  where  people  can  shop.  In  addition  to  ordering  products  from  on-line  hypertext  catalogs,  the 
WWW  provides  a  market  for  research  ideas  where  the  acceptance  of  ideas  may  depend  on  how  easily  they  can  be  lo¬ 
cated. 

Database  Structure 

If  we  assume  that  the  WWW  is  moving  in  a  commercial  direction,  then  the  perspective  of  the  WWW 
database  designer  should  be  oriented  toward  a  structure  that  is  general  enough  to  provide  the  information  that  would 
be  expected,  in  a  standard  format,  rather  than  supporting  a  totally  unstructured  information  system.  The  hypertext 
page  is  analogous  to  the  tuple  in  relational  database  systems.  Pages  could  be  linked  together  to  form  an  IMS-Hke 
database  system.  A  more  appropriate  model  for  WWW  databases  is  an  object-oriented  database  structure.  The 
WWW  access  methods  (page  retrieval)  are  already  designed  to  access  pages  of  varying  formats.  Additional  structure 
is  required  to  support  identification  of  attributes  from  base  classes  which  define  the  basic  structures  which  can  be 
used  in  sub-classes  that  use  them  in  their  definition.  Such  a  definition  could  be  used  for  accessing  an  personal  URL 
type  page.  Each  page  should  provide  a  core  of  information  (in  this  case  the  name,  company,  and  email  address)  asso¬ 
ciated  with  the  individual. 
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Access  Methods 


Data  that  is  made  available  on  the  WWW  will  be  more  efficienUy  retrieved  through  robots  which  imple¬ 
ment  parallel  exploration  of  access  paths  to  the  data  required.  A  search  for  a  product  will  be  successful  if  data  is 
readily  accessible  (see  above)  and  if  an  appropriate  search  engine  or  robot  is  in  use.  In  the  case  of  the  TkWWW 
Robot  prototype,  described  above,  the  support  for  distributed  processing  would  retrieve  lists  of  <address,  title>  pairs 
for  the  links  known  to  the  server.  Either  all  links  or  only  those  at  a  particular  hop  count  from  the  entry  point  would 
be  retrieved.  The  server  would,  with  a  minor  increase  in  local  processing,  significantly  reduce  network  traffic. 

The  TkWWW  browser  operates  on  the  HTML  retrieved  from  remote  servers  and  performs  all  operations  locally.  It 
would  seem  reasonable  that  simple  search  functions  be  added  to  servers  that  would  allow  search  operations  to  use 
remote  processing  resources  in  order  to  reduce  the  impact  on  the  network  at  a  server.  A  server-based  search  capabili¬ 
ty  would  allow  a  robot  or  browser  to  return  all  links  stored  at  a  site,  rather  than  retrieving  all  pages,  then  throwing 
away  all  data  except  the  links. 

Indexing 

Indexing  techniques  are  conunonly  used  in  database  systems  that  are  highly  structured.  Many  static  indexes 
have  been  developed  for  access  to  WWW  resources.  They  are  mainly  produced  through  exhaustive  search  tech¬ 
niques.  The  indexes  are  not  updated  upon  changes  to  the  objects  they  point  to,  as  they  would  be  in  a  standard 
database  system.  This  is  due  in  part  to  an  effort  to  maintain  the  autonomy  of  servers.  Robots,  like  the  W4  Worm  can 
not  insist  that  the  objects  that  they  put  into  their  indices  report  back  to  them  upon  changes  in  their  status.  Increasing 
the  frequency  of  exhaustive  search  would  be  a  large  burden  to  the  network  in  order  to  avoid  a  few  missed  references. 
Good  indexes  exist  on  the  WWW  for  convenient  search  of  many  resources,  by  name,  subject,  etc. 
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The  only  mechanism  that  supports  user  controlled  static  index  construction  for  today’s  browsers  requires 
manually  inserting  pointers  to  pages  into  a  bookmark  list.  In  most  cases,  bookmarks  are  unrelated  and  are  used  as 
starting  points  for  independent  searches. 

Future  Robots:  Using  Robots  on  Tomorrow's  WWW 

Eventually,  the  WWW  must  move  away  from  a  model  where  all  decisions  regarding  WWW  navigation  are 
made  by  the  local  browser/client/robot,  based  on  remote  information  retrieval.  Current  robots  have  no  choice  but  to 
retrieve  links  from  remote  sites  as  a  basis  for  their  navigation.  The  TkWWW,  with  its  flexible  implementation  is 
poised  to  change  as  the  WWW  moves  in  the  direction  of  increased  intelligence  in  database  servers.  This  section  de¬ 
scribes  some  of  the  changes  that  will  force  WWW  designers  to  address  the  issues  involved  in  increasing  the  func¬ 
tionality  in  WWW  servers  and  clients. 

Real-Time  Video  Communication 

Much  of  what  has  driven  the  evolution  of  the  WWW  involves  integration  of  multimedia  databases  through 
a  high-level  object-oriented  interface.  Many  systems  of  the  future  will  have  real-time  constraints  and  will  include 
videotelecommunications  as  an  integral  component.  The  TkWWW  robot  is  poised  to  meet  the  challenges  of  these 
new  environments.  In  addition  to  having  real-time  constraint  requirements,  future  servers  fill  be  characterized  by  in¬ 
creasingly  dynamic  behavior.  Servers  may  offer  buttons  to  initiate  dynamic  videoconferences  based  on  the  instanta¬ 
neous  availability  of  an  appropriate  research  expert  or  salesman,  depending  on  the  organization  offering  the  connec¬ 
tion. 


Searching  for  sources  of  live  video  provides  the  WWW  analog  of  a  channel  scanner  where  each  WWW 
page  may  provide  a  source  of  live  video.  Robot  starting  points,  similar  to  the  "bookmark"  capability  in  today’s 
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browsers  gives  a  robot  a  set  of  initial  starting  points  for  a  search.  This  functionality  is  already  implemented  in  the 
TkWWW  Robot.  A  robot  may  find  all  experts  in  an  area  that  are  prepared  for  a  videoconference.  In  a  similar  way, 
real-time  video  may  be  used  for  talking  to  sales  and  technical  representatives.  Information  services,  analogous  to 
touchtone  hierarchy  traversal  for  telephone-based  systems  already  exist,  without  the  ability  to  establish  a  direct 
videoconference  connection. 

A  typical  TkWWW  Robot  query  of  the  future  (with  a  natural  language  interface): 

"Go  find  someone  to  talk  to  and  negotiate 
a  price  of  less  than  $X  for  the  conversation" 

Individual  Databases  and  Servers 

Robots  are  useful  to  support  the  flexibility  of  personal  data  representation  and  communication,  helping  peo¬ 
ple  communicate  by  integrating  network,  video,  and  multimedia  databases  into  modem  information  systems.  They 
can  perform  automated  answering,  filtering  and  customizing  messages  received  through  a  communication  system, 
depending  on  the  authorization  of  a  caller.  A  robot  can  actively  pursue  information  on  behalf  of  its  master  by  finding 
servers  that  have  answers  to  questions  and  retrieving  the  appropriate  data. 

Advanced  Functionality 

The  use  of  a  Tcl-based  robot  offers  the  potential  for  communication  and  cooperation  with  distribution 
robots,  by  client-agent  robots.  EXPECT  extensions  to  the  basic  Tcl  command  set  allow  discrimination  of  requests 
and  can  be  used  to  enhance  both  the  TkWWW  browser  and  the  TkHttpd.  Security  would  not  be  a  problem  since  the 
browser  would  control  interaction  with  the  server.  A  server  (httpd)  robot  may  query  the  chent  database  to  determine 
areas  where  a  search  could  provide  additional  information  and  avoid  unnecessary  overhead  needed  to  return  data  that 
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is  already  available.  This  suggests  that  data  should  be  structured  as  HTML  in  a  hierarchic  database  that  is  under¬ 
stood  by  the  httpd. 

Future  Research 

The  TkWWW  prototype  demonstrated  the  feasibility  of  implementing  a  robot.  Extensions  will  be  needed  to 
develop  support  for  additional  capabilities  described  in  the  paper.  In  addition  to  handling  the  basic  link  objects,  the 
robot  will  be  upgraded  to  retrieve  other  types  of  HTML  objects  will  be  implemented.  New  TkWWW  Robot  com¬ 
mands  to  search  for  objects  independent  of  type,  perhaps  based  on  size  or  some  other  attribute  would  also  be  inter¬ 
esting.  A  command  that  searches  for  a  logical  combination  of  given  parameters  can  search  pages  that  satisfy  expres¬ 
sions  like  (mpeg  or  jpeg  or  gif). 

More  complex  commands  may  search  pages  based  on  the  relationship  of  objects  on  a  page.  A  low-level  Tk¬ 
WWW  Robot  command  to  return  the  next  object  on  a  page  would  allow  queries  like  "get  the  addresses  of  all  pages 
that  have  mpegs  immediately  followed  by  audios",  ie.  queries  where  the  ordering  of  objects  is  important.  In  general 
the  Tcl  functions  will  determine  the  characteristics  that  are  sought  on  the  objects.  For  example  "find  pages  that  have 
an  mpeg  followed  by  at  least  3  audio  files  where  the  sizes  of  audio  files  are  increasing"  would  indicate  how  many 
WWW  database  managers  sort  their  files. 

There  are  several  other  research  issues  related  to  the  operation  of  WWW  servers  that  should  be  addressed  to 
improve  the  general  efficiency  of  Internet  robots.  Other  issues  involve  commercializing  the  WWW.  Robots  must  be 
able  to  avoid  servers  that  would  charge  for  their  access.  The  current  approach  to  limiting  robot  access,  through  the 
installation  of  an  advisory  file  which  requests  that  robots  refrain  from  access  can  be  amplified  to  provide  them  with 
the  equivalent  of  a  toll  booth  where  desired  by  the  server  operator.  Some  of  the  appropriate  stracture  may  depend  on 
the  rate  structure  that  develops  as  NSFnet  is  turned  over  to  commercial  providers. 
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Conclusion 


The  TkWWW  robot  can  be  progranuned  to  perform  most  of  the  functions  of  other  robots  that  are  in  com¬ 
mon  use.  The  work  being  done  on  the  robot  is  focused  on  providing  additional  low  level  functionality  and  develop¬ 
ing  techniques  to  implement  indexes  based  on  the  current  content  of  databases  of  interest.  It  seems  likely  that  addi¬ 
tional  types  of  data  will  be  continue  to  be  integrated  into  the  WWW  framework.  The  TkWWW  can  adapt  quickly  to 
changes  in  the  environment,  making  it  a  reasonable  target  for  future  development 

The  ability  to  develop  advanced  robot  functionality  depends  on  whether  server  functions  can  be  developed 
to  help  robots  do  their  work  efficiently.  If  initial  experimentation  with  robots  is  successful,  additional  server  func¬ 
tionality  seems  inevitable.  Servers  that  do  not  support  robot  access  will  be  overlooked,  causing  a  great  deal  of  effort 
invested  in  their  development  and  maintenance  to  be  of  limited  value. 
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Abstract 

Experimental  measurements  of  the  bistatic  radar  scattering  from  a  wall  of  deciduous 
trees  have  been  performed.  The  coherent  like-  and  cross-polarization  scattered  fields  were 
measured  for  both  vertical  and  horizontal  transmit  polarization,  allowing  the  calculation 
of  the  full  Mueller  scattering  matrix.  The  transmitter  location  was  fixed  throughout  the 
experiment,  while  the  receive  position  was  varied  approximately  along  an  arc  around  the 
illuminated  section  of  trees.  The  illumination  elevation  angle  was  downward  at  6  degrees 
while  the  measured  receive  elevation  angles  varied  from  approximately  0  degrees  to  6 
degrees  downward. 
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I.  INTRODUCTION 


The  ability  of  a  radar  to  detect  a  target  is  often  limited  by  the  clutter  returns  from 
vegetation  canopies.  One  method  to  improve  the  detection  of  a  target  in  a  clutter  back¬ 
ground  is  to  use  polarimetric  target-clutter  discrimination  techniques  (Eaves  and  Reedy, 
1987).  These  techniques  use  both  the  like-  and  cross-polarized  scattering  characteristics 
of  the  target  and  clutter,  thereby  yielding  more  information  for  the  detection  algorithm. 
Another  method  is  the  use  of  bistatic  radar  systems,  where  the  transmitter  and  receiver  are 
not  co-located.  This  method  is  particularly  useful  in  detecting  targets  that  are  designed 
to  minimize  the  scatter  in  the  backscatter  direction.  For  these  techniques  to  be  used  most 
effectively  the  bistatic  polarimetric  scattering  characteristics  of  the  clutter  as  well  as  the 
target  must  be  well  described. 

Experimental  measurement  of  the  bistatic  scattering  from  a  wall  of  deciduous  trees  has 
been  performed.  Full  polarimetric  scattering  was  measured  and  the  effects  of  the  Stokes 
vectors  of  the  radar  antennas  were  calibrated,  allowing  determination  of  the  full  Mueller 
scattering  matrix  of  the  tree  clutter  patch.  The  experiment  was  performed  at  the  Rome 
Laboratory-Hanscom  AFB  Ipswich  test  site  with  William  Stevens  (Rome  Laboratory)  and 
David  McLaughlin  (Northeastern  University). 

11.  EXPERIMENTAL  HARDWARE 


A.  Radar  system 

The  radar  system  used  is  described  by  Rao  et  al.  (1991).  The  operating  frequency 
was  3.2  GHz,  and  the  CW  transmitted  power  was  3  W.  Range  resolution  was  was  obtained 
using  a  1023  bit  binary  phase  code  modulated  at  200  Mbits/sec,  yielding  a  1.5  m  one-way 
range  resolution.  All  signals  within  the  transmitting  unit  were  derived  from  the  same 
stable  5  MHz  crystal  oscillator.  The  transmitter  provides  a  single  channel,  so  transmitted 
polarization  was  controlled  by  an  external  switch. 
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The  receiver  unit  was  completely  independent  of  the  transmitter,  and  is  therefore 
clocked  to  different  stable  crystal  oscillator.  Coherence  was  maintained  by  phase-locking 
the  receiver  to  the  direct  feed-through  signal  from  the  transmitter.  The  receiver  generates 
a  binary  phase  code  that  is  identical  to  the  transmitted  code.  Range  compression  of  the 
received  signal  is  performed  using  a  Costas-loop  processor  (Holmes,  1990).  The  range 
side-lobes  resulting  from  this  technique  limit  the  system  signal- to-noise  ratio  to  30  dB. 
The  equivalent  output  pulse  repetition  period  (PRP)  of  the  Costas  loop  was  20  mS  (pulse 
repetition  frequency  of  50  Hz),  indicating  that  clutter  fluctuations  of  up  to  50  Hz  can 
be  measured.  The  range  cells  were  output  from  the  Costas-loop  at  a  615.66498  KHz 
clock  rate.  The  receiver  included  two  channels  that  were  configured  for  simultaneous 
measurement  of  the  vertical  and  horizonatal  polarization  scattered  field.  Each  channel 
provided  the  inphase,  quadrature,  and  magnitude  signals  after  range  compression,  and  full 
coherence  was  maintained  so  the  relative  phase  between  the  two  scattered  polarizations 
was  conserved. 

The  receiver  includes  an  automatic  gain  control  (AGC)  circuit  that  sets  the  peak 
received  signal  to  a  safe  level  below  the  receiver  saturation  point.  The  radar  was  calibrated 
for  full  measurement  of  the  target  Mueller  scattering  matrix  using  the  technique  of  Mead 
and  McIntosh  (1990). 

C,  Antennas 

The  transmitting  antenna  was  a  Seavey  Engineering  Associates  Model  AS48-33NA 
dual  polarized  feed  4  foot  dish  antenna.  The  measured  gain  at  both  polarizations  was  28.6 
dB  over  isotropic  with  3-dB  beamwidths  of  5.5  degrees  in  both  planes.  The  highest  sidelobe 
level  (-19.3  dB)  occured  in  the  verical  polarization,  H-plane  pattern.  Cross-polarization 
isolation  was  better  than  -30  dB.  The  transmitted  signal  was  switched  between  the  two 
feed  ports  using  an  external  ferrite  switch. 

The  transmit  antenna  was  positioned  on  a  tripod  that  included  a  calibrated  elevation- 
over-azimuth  positioner.  The  tripod  was  placed  on  a  level  Worthington  positioner  that 
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is  located  on  the  roof  of  a  building  at  the  Rome  Laboratory’s  Ipswich  test  site,  as  shown 
in  Figure  1.  As  configured,  The  antenna  center  was  45  feet  above  ground  level.  The 
Worthington  positioner  was  not  moved  throughout  the  experiment.  A  narrow- view  video 
camera  was  attached  to  the  transmitting  antenna  for  bore-sighting.  The  camera  was 
aligned  at  the  begining  of  the  experiment  by  maximizing  the  direct  feed-through  to  the 
receiver  antenna.  Alignment  was  reconfirmed  at  the  end  of  the  experiment. 


Figure  1.  Transmitting  antenna  at  Rome  Laboratory  Ipswich  test  site. 


The  receiver  antenna  was  a  dual  polarized,  wide  band  conical  horn.  Calibrated  mea¬ 
surements  of  the  gain,  beamwidth,  and  cross-polarization  characteristics  were  not  com¬ 
pleted  at  the  end  of  the  summer  fellowship;  however,  simple  field  measurements  of  the 
direct  feed-through  signal  and  comparison  with  a  standard  gain  horn  receive  antenna  indi¬ 
cated  the  cross-polarization  isolation  was  better  than  20  dB  and  the  gain  and  beamwidths 
were  consistant  with  that  expected  from  the  5  1/2  inch  antenna  aperature. 

The  receive  antenna  was  mounted  directly  to  an  uncalibrated  elevation  over  azimuth 
positioner.  The  positioner  was  mounted  directly  to  two  levels  of  scaffolding,  which  were 
in  turn  placed  on  the  roof  of  a  site  truck,  as  shown  in  Figures  2  and  3.  The  antenna  was 
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Figure  3*  Receive  and  transmitting  antennas. 


approximately  23  feet  above  ground  level.  The  truck  was  moved  to  each  measurement 
position  and  was  stablized  with  four  hydraulic  jacks.  The  postioner  was  then  leveled  and 
visually  boresighted  at  the  target  area  along  a  straight  edge  that  was  parallel  to  the  peak 
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of  the  antenna  beam. 


D.  Data  Aqnisition 

The  inphase,  quadrature,  and  magnitude  signals  at  both  polarizations  were  digitized 
and  stored  directly  to  a  hard  disk.  The  AGC  setting  of  the  receiver  was  also  digitized 
and  recorded.  The  signals  were  recorded  continuously  for  30  s  at  each  receiver  position 
and  transmitter  polarization.  At  least  two  redundant  data  aquisition  runs  were  made  for 
each  unique  transmitter/receiver  configuration.  Because  of  the  short  sample  time,  run  that 
were  made  during  low  wind  speed  periods  were  repeated,  thereby  maximizing  the  number 
of  independent  samples  available  for  estimating  the  scattering  coefficients. 

III.  EXPERIMENTAL  LAYOUT 

The  positions  of  the  transmitter  antenna,  target  clutter  patch,  and  the  receiver  loca¬ 
tions  are  shown  in  Figure  4.  The  target  clutter  patch  was  a  small  outcropping  of  trees, 
shown  in  Figure  5.  The  antennas  were  boresighted  at  a  point  6  m  above  the  ground  at 
the  edge  of  the  outcropping  and  about  8  m  below  the  tree  tops.  (The  boresight  point  was 
marked  by  a  ribbon  that  is  not  visible  in  the  Figure.)  The  distances  from  the  target  clutter 
patch  to  the  individual  receiver  positions  and  the  angular  postion  of  the  receiver  locations 
(with  respect  to  the  transmitter  locations)  are  given  in  Table  1.  Because  of  the  unevenness 
and  general  slope  of  the  terrain  (roughly  3  degrees,  with  the  lowest  point  near  lower  left 
of  Figure  4  and  the  highest  point  at  the  upper  right),  the  absolute  receiver  antenna  height 
varied  with  position.  Thus,  the  elevation  look  angle  of  the  receiver  antenna  is  different  at 
each  position  (also  summarized  in  Table  1). 

The  transmitting  antenna  was  63.7  m  from  the  target  clutter  patch  and  pointed  down 
in  elevation  at  6°  when  boresighted  on  the  target  point. 

IV.  RESULTS 
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RECEIVER  LOCATIONS 


Tree  Line 

+  Clutter  Target 

O  Transmitter  Position 


X  Receiver  Positions 

Figure  4.  Receiver,  transmitter,  and  cutter  patch  positions. 


Data  collection  efforts  were  just  being  completed  during  the  last  week  of  the  summer 
fellowship.  Data  reduction  and  preparation  for  analysis  is  currently  being  performed  by 
Rome  Laboratory  personnel  so  no  results  available  for  this  report. 
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Figure  5.  Target  clutter  patch. 
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Receiver 

Azimuth 

Angle  From 

Distance  From 

Elevation 

Position 

Transmitter 

Target  Area 

Look  Angle 

Number 

(degrees) 

(m) 

(degrees) 

1 

0 

51.8 

-0.6 

2 

10 

56.4 

-0.4 

3 

20 

49.7 

-1.0 

4 

30 

52.1 

-1.3 

5 

40 

51.5 

-1.9 

6 

50 

50.6 

-2.1 

7 

60 

49.1 

-2.5 

8 

70 

49.1 

-3.5 

9 

80 

48.5 

-4.0 

10 

90 

48.9 

-5.0 

11 

100 

48.8 

-5.0 

12 

no 

49.1 

-5.5 

13 

120 

50.3 

-6.0 

14 

130 

44.2 

-5.6 

15 

140 

39.6 

-5.7 

Table  1.  Receiver  positions  used.  The  angular  position  is  given  as  the  angle  between  the 
line  from  the  transmitter  to  the  clutter  target  and  the  line  from  the  receiver  to  the 
clutter  target. 
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TRANSFERRING  TECHNOLOGY  VIA  THE  INTERNET 


Rolf  T.  Wigand,  Slawomir  J.  Marcinkowski  and  John  Carlo  Bertot 
School  of  Information  Studies 
Syracuse  University 

Abstract 

The  current  global  economic  climate  Is  such  that  a  nation  acquires  and  maintains  its  wealth,  prosperity, 
and  strength  predominantly  through  trade.  It  is  necessary,  but  no  longer  sufficient,  for  a  nation  to  possess 
a  strong  military  function  in  a  global  marketplace.  This  new  emphasis  on  national  competitiveness  in  trade 
places  Rome  Laboratory,  as  well  as  other  federal  laboratories,  at  an  important  crossroads.  On  the  one 
hand,  a  military  advantage  requires  continual  technological  superiority.  On  the  other  hand,  Rome 
Laboratory  needs  to  facilitate  national  economic  development  through  the  transfer  of  its  technologies  into 
the  marketplace.  These  developments  serve  to  highlight  the  importance  of  a  proactive  technology  transfer 
process  within  Rome  Laboratory.  However,  a  proactive  operation  is  difficult  to  put  into  action  without 
forthcoming  budgetary  and  personpower  increases.  This  study  focuses  on  a  low-cost  alternative:  to  use 
MOSAIC  on  the  Internet  and  the  World  Wide  Web  to  promote  the  transfer  and  commercialization  of 
technology.  An  electronic  system  was  developed  allowing  access  on  various  technology  transfer 
information  and  databases  to  the  private  sector,  as  well  as  Rome  Laboratory  and  other  Air  Force  and  public 
sector  users.  This  report  describes  these  efforts,  underlying  reconceptualizations,  design  and  implications 
of  the  electronic  system. 
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TRANSFERRING  TECHNOLOGY  VIA  THE  INTERNET 
Rolf  T.  Wigand,  Slawomir  J.  Marcinkowski  and  John  Carlo  Bertot 


Introduction 

The  current  global  economic  climate  is  such  that  a  nation  acquires  and  maintains  its  wealth,  prosperity, 
and  strength  predominantly  through  trade.  It  is  necessary,  but  no  longer  sufficient,  for  a  nation  to  possess 
a  strong  military  function  in  a  global  marketplace.  This  new  emphasis  on  national  competitiveness  in  trade 
places  Rome  Laboratory  (RL),  as  well  as  other  federal  laboratories,  at  an  important  crossroads.  On  the  one 
hand,  a  military  advantage  requires  continual  technological  superiority.  On  the  other  hand,  RL  needs  to 
facilitate  national  economic  development  through  the  transfer  of  Its  technologies  into  the  marketplace. 
These  developments  serve  to  highlight  the  importance  of  a  proactive  technology  transfer  operation  within 
RL.  However,  a  proactive  operation  is  difficult  to  put  into  action  without  forthcoming  budgetary  and 
personpower  increases.  This  study  attempts  to  identify  a  low-cost  alternative:  to  use  the  Internet  to 
promote  the  transfer  of  technology. 

RL  is  currently  positioning  itself  to  meet  the  challenges  precipitated  by  the  emphasis  on  the  technology 
transfer  process.  This  process,  if  viewed  as  a  life  cycle,  minimally  contains  the  following  components: 
Technology  research 
Technology  development 
Technology  announcement 
Technology  marketing 
Fulfilling  technology  information  requests 
Making  the  technology  available  for  outside  use 
Technology  utilization,  and 
Technology  commercialization. 

In  addition,  technology  transfer  occurs  within  political  and  social  climates  that  may  impose  certain 
constraints.  The  overriding  condition  in  all  of  these  considerations  is,  of  course,  first  of  all  the  Air  Force 
mission.  In  that  sense  technology  transition  within  the  Air  Force  is  of  primary  importance,  followed  by 
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technology  transfer  to  the  private  sector,  hopefully  resulting  in  technology  commercialization.  The  latter 
term  incorporates  such  desirable  events  as  successful  technology  transfer,  technology  utilization, 
profitability  of  the  technology  when  marketed  by  a  firm,  hopefully  resulting  in  the  creation  of  jobs,  adding 
value,  contributing  to  the  regional  and  national  economies,  and  adding  overall  to  the  Gross  Domestic 
Product. 

Many  of  these  technology  transfer  tasks  are  mandates  and  specified  by  legislation  such  as  the  Stephenson- 
Wydier  Act,  the  Technology  Transfer  Act,  and  others.  There  are  also  certain  limitations  and  constraints 
on  the  technology  transfer  process  such  as  export  control  regulations  intended  to  help  only  U.  S.-owned 
companies. 

Technology  transfer  deliverables  could  be  classified  as:  products,  knowledge,  advice,  know-how,  and 
facilities  use.  These  deliverables  are  not  unique  to  RL,  but  can  be  envisioned  to  apply  to  all  federal 
laboratories.  They  can  be  broken  down  into  a  set  of  specific  and  clearly  definable  technology  transfer 
activities  may  include  the  following: 

Cooperative  Research  and  Development  Agreements  (CRDAs) 

Educational  Partnerships 

Exhibits 

Conferences 

Presentations  (by  the  Technology  Transfer  Office  or  scientists  and  engineers) 

White  papers 

information  exchange 

Inquiries  (face-to-face,  by  phone,  written) 

Publications 
Advertising 
Public  relations 

Communication  with  company  representatives 
Electronic  dissemination 
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Others. 


The  present  focus  for  this  study  is  on  just  one  of  the  above  activities,  electronic  dissemination  of 
information  about  technology. 

The  authors  reviewed  the  technology  transfer  process  at  RL  and  revisited  this  process  by  reconceptualizing 
the  underlying  assumptions  pertaining  to  basic  principles  of  marketing,  diffusion,  information  retrieval  and 
networking.  The  key  driver  of  this  reconceptualization  was  that  of  a  user  or  target  audience  orientation. 
From  this  perspective  a  model  was  developed  that  provides  the  desired  electronic  linkage  between  RL  and 
targeted  private  sector  firms. 

Reconceptualization 

Four  different  approaches  to  a  reconceptualization  of  the  technology  transfer  process,  especially  when  this 
process  occurs  electronically,  were  advanced.  Each  is  described  below: 

A-  Marketing.  All  marketing  efforts  are  based  on  the  basic  premise  that  there  is  a  specific  audience  of 
consumers.  This  is  a  set  of  specific  firms  within  an  industry  or  the  private  sector,  which  have  needs  that 
can  be  filled  by  firms  operating  within  a  specific  market.  In  the  this  case,  RL  constitutes  such  a  firm  that 
can  fill  the  demand  for  specific  technologies  that  can  be  marketed  and  commercialized  by  private  sector 
firms. 

One  can  identify  three  main  policies  of  orientation  within  the  marketing  effort:  customer  orientation, 
product  orientation  and  profit  orientation.  The  latter  orientation  is  not  an  appropriate  orientation  for  RL. 
A  product  orientation  is  predicated  on  the  view  that  consumers  will  recognize  and  appreciate  products  for 
superior  merit  and  bestow  their  patronage  on  firms-in  this  case  the  product  is  research  and  development. 
In  the  present  case  though,  it  must  be  realized  that.  In  general,  this  product  is  not  designed  by  RL  having 
private  sector  firms  in  mind  foremost.  Rather,  the  key  customer  is  the  Air  Force  and  the  product  and  the 
product  orientation  are  determined  by  the  Air  Force  mission.  An  exclusive  product  orientation  effort  is 
therefore  not  wise  for  RL,  but  the  product  certainly  determines  a  potential  market  that  can  be  tapped 
within  the  private  sector.  It  is  therefore  apparent  that  an  appropriate  focus  for  RL  in  terms  of  marketing 
orientation  is  a  customer  orientation. 
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In  this  setting,  a  customer  orientation  denotes  (a)  an  attitude  and  a  pattern  of  conduct,  as  well  as  (b)  the 
extent  to  which  RL  tries  to  determine  what  its  customers  want  and  then  gives  them  what  they  want. 
Granted,  it  is  often  difficult  and  complex  to  discover  what  the  customer  wants.  For  example,  in  such 
efforts  one  recognizes  that  customer  preferences  vary  widely  and  that  customers  want  something  only 
once  they  are  given  concrete  choices,  i.  e.  they  are  incapable  of  conceiving  possible  new  products. 
Potential  customers  often  also  request  features  that  are  incompatible  with  product  capabilities. 

The  basic  challenge  faced  by  RL  is  to  identify  those  needs  and  provide  a  linkage  between  RL  and  the 
customer,  i.  e.  a  firm  interested  in  a  specific  RL  technology.  To  maximize  this  linkage  and  relationship  to 
customers,  RL  needs  to  form  hypotheses  and  understanding  about  these  present  and  future  potential 
customers.  Included  should  be  questions  such  as: 

What  kinds  of  things  affect  customer  behavior? 

Through  which  channels  (advertising,  face-to-face  contacts,  publications,  etc.)  can  customers  be 
reached  by  RL? 

What  is  the  degree  or  strength  of  need  or  desire  for  the  product? 

What  are  the  appropriate  appeals  (or  arguments)  to  which  customers  are  most  responsive? 
What  is  the  customer's  responsiveness  to  different  types  of  sales  devices,  i.  e.  their  ability  to  be 
influenced  by  technology  transfer  discussions  by  the  Technology  Transfer  staff,  engineers 
and  scientists? 

After  these  questions  have  been  answered,  the  marketing  dimension  entails  five  general  activities; 

1 .  Identifying  and  selecting  the  type  of  customer  that  RL  chooses  to  cultivate  and  learning  that  set 
of  firms'  needs  and  desires. 

2.  Designing  products,  know-how  and  services,  to  include  facilities,  that  RL  can  transfer  in 
conformity  with  customer  desires. 

3.  Persuading  customers  to  acquire  and  adopt  RL's  products,  know-how  and  services. 

4.  Displaying,  moving  and  to  some  extent  storing  products,  know-how  and  services  after  they  have 
been  developed  at  RL. 


25-6 


5.  Identifying  dual  use  technologies  and  applications. 

In  designing  its  products,  acquiring  and  developing  know-how,  and  deciding  how  much  and  what  specific 
services  to  offer,  RL  will  unquestionably  benefit  from  having  a  clear  picture  of  its  target  customer.  The 
present  project  attempts  to  provide  a  linkage,  an  electronic  marketing  channel,  between  these  target 
customers  and  RL. 

B-  Diffusion.  Diffusion  is  another  conceptualization  that  guided  the  study.  Diffusion  is  the  social  process 
by  which  an  innovation  is  communicated  through  certain  channels  over  time  among  members  of  a  social 
system  (Rogers,  1983).  This  process  clearly  resembles  the  task  at  hand  for  RL  in  the  technology  transfer 
process:  i.  e.  a  technology  or  an  innovation  within  RL,  needs  to  be  communicated  to  a  set  of  firms 
(members  of  a  social  system)  within  a  particular  industry.  The  communication  channel  chosen  for  the 
present  study  is  that  of  electronic  messaging  via  the  Internet.  The  authors  view  such  electronic  diffusion 
means  as  timely,  cost-effective,  quick  and  targeting  specific  means  to  reach  the  target  customer  described 
above. 

Many  other  dimensions  could  be  addressed  here  that  determine  the  speed  with  which  such  diffusion 
occurs.  Obviously  the  quicker  the  target  audience  can  be  reached  and  the  significance  of  the  technology 
to  be  transferred  can  be  communicated,  the  more  effective  the  diffusion  process.  The  rate  of  diffusion  or 
adoption  of  that  technology  is,  however,  determined  by  the  characteristics  of  the  innovation  (e.  g.,  the 
relative  advantage,  compatibility,  complexity,  trialability  and  observability). 

The  communication  channel  chosen,  electronic  messaging  via  the  Internet,  determines  in  part  the  success 
of  making  a  successful  linkage  with  the  target  customer.  General  diffusion  principles  posit  that,  ideally, 
successful  communication  occurs  in  a  face-to-face  setting  with  the  target  audience  or  customer.  This, 
however,  is  often  impracticable  or  too  expensive.  Substitutes  to  face-to-face  settings  are  chosen  and  the 
mass  media  come  into  play.  The  use  of  mass  media  in  such  efforts,  as  in  advertising,  is  often  a  vague 
undertaking,  as  one  cannot  be  sure  that  one's  message  truly  reaches  the  intended  target.  Often  a 
compromise  between  the  two  extremes  is  desirable,  e,  g.,  the  use  of  mass  media,  followed  up  by  face-to- 
face  meetings  with  smaller  groups,  etc.  Electronic  messaging  via  the  Internet  comes  close  to  this  ideal. 
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i.  e.  using  a  cost-effective  mass  medium  and  still  reaching  specific  individuals.  Moreover,  the  Internet's  use 
makes  it  possible  to  be  Interactive  with  specific  individuals  within  the  target  group  and  thus  allowing 
almost  face-to-face  interaction.  This  potential  for  interactivity  certainly  makes  the  medium  highly  attractive 
as  requests,  needs,  etc.  certainly  can  be  customized.  Moreover,  such  interactivity  makes  possible  the  often 
missing  feedback  in  the  technology  transfer  process.  Such  feedback  allows  us  to  shape  and  additionally 
customize  the  very  next  step  in  the  diffusion  and  communication  process.  Such  customization  is  almost 
impossible  when  viewing  the  diffusion  process  via  advertising  as  a  communication  channel. 

C.  Information  Retrieval.  In  an  electronic  system,  such  as  using  the  Internet,  it  Is  obvious  that  the  users 
can  readily  retrieve  information  once  the  information  is  stored  on  the  system.  There  are  many  data  bases 
available  at  RL  that  could  be  accessed  through  the  Internet,  and  many  of  those  pertain  to  technology 
transfer  and  related  issues.  The  authors  made  every  effort  to  identify  such  data  bases  and  make  them 
available  in  a  format  conducive  to  outside  users,  hopefully  firms  that  might  acquire  a  particular  product, 
know-how  or  service.  The  design  of  such  information,  data  bases,  etc.  often  determines  their  successful 
use.  In  this  sense,  a  customer  or  end-user  orientation  was  applied  in  the  design  of  the  system.  Unless 
such  design  is  inviting,  encouraging,  timely,  informative  and  user-friendly,  the  success  of  the  system  is 
highly  questionable  (Taylor,  1986). 

D,  Networking.  The  topic  of  networking  as  a  conceptualization  embeds  all  three  of  the  above  topics,  i.  e. 
marketing,  diffusion  and  information  retrieval.  Without  these  three,  networking  would  be  impossible. 
Networking  in  this  sense  goes  beyond  the  traditional  means  of  reaching  a  target  customer.  George 
Kozmetsky,  director  of  the  IC2  Institute,  Austin,  Texas,  stated  at  a  recent  conference  on  commercialization 
of  technology  from  federal  laboratories  that,  "The  secret  to  successful  commercialization  of  technology 
developed  in  the  federal  labs  is  networking,  networking,  networking  (Kozmetsky,  1993)."  This  importance 
was  already  demonstrated  in  an  empirical  NSF-funded  study  by  Wigand  focusing  on  technology  transfer 
issues  within  the  microelectronic  industry  and  the  role  of  industry,  government  and  universities  (Wigand 
and  Frankwick,  1989). 

Networking  denotes  interaction,  feedback,  customizing  and  creating  a  dialogue  to  a  degree  that  otherwise 
would  not  exist.  In  that  sense  and  in  this  particular  case,  RL  may  conceive  of  transferring  technology,  at 
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least  in  the  beginning  stages,  as  a  partially  cooperative  means  with  potential  target  customers  within 
industry.  Networking  would  suggest  the  creation  of  Listservs,  electronic  bulletin  boards  (discussion 
groups),  direct  electronic  inquiries  that  might  be  shared  with  the  rest  of  likewise  interested  individuals/firms 
around  a  particular  product,  know-how  or  technology.  Such  individuals  within  firms  appear  to  be  quite 
interested  in  such  dialogue  as  a  technology  emerges,  although  it  is  understandable  that  such  dialogue 
would  be  reduced  or  cease  entirely  once  the  technology  becomes  highly  marketable  or  commercializable 
due  to  competitive  reasons.  But  even  if  RL  could  speed  up  the  development  and  diffusion  of  a  technology 
to  such  a  point  and  if  interaction  would  then  discontinue,  RL  still  would  have  provided  a  most  valuable 
service  to  the  development  of  that  technology  and  industry.  Typically,  such  networks  do  not  come  about 
by  themselves.  Such  interaction  must  be  fostered,  encouraged  ,  managed  and  nurtured  by  knowledgeable 
professionals  within  RL,  perhaps  among  the  Technology  Transfer  staff  or  the  Directorate's  Technology 
Transfer  Focal  Points.  Networking,  in  this  sense,  is  conceived  as  the  highest  level  of  technology  transfer 
along  the  electronic  medium  of  using  the  Internet.  Lastly,  networking  as  envisioned  here,  is  characterized 
as  rating  high  in  terms  of  customer  interaction,  as  well  as  high  in  terms  of  information  content,  implying 
that  RL  has  the  means  to  get  closer  to  the  customer.  Moreover,  It  is  interactive,  multifaceted,  customized, 
and  on  demand  with  time  delay  and  time  zone  shifts  playing  merely  a  minor  role. 

Methodology 

Various  methods  were  employed  to  investigate  and  research  the  task  at  hand.  The  authors  conducted 
interviews  with  numerous  individuals  within  RL  and  with  non-RL  employees.  Interviews  were  conducted 
with  the  members  of  the  RL  Technology  Transfer  Office  staff,  including  RL  elements  at  Hanscomb  AFB, 
as  well  as  several  engineers  and  scientists.  One  engineer  was  interviewed  in  greater  depth,  as  his  ACT 
technology  was  highlighted  for  demonstration  purposes  on  the  Internet,  i.  e.  the  World  Wide  Web  (WWW) 
at  RL.  This  particular  technology  was  demonstrated  to  show  the  various  capabilities  and  cross-linkages 
possible  using  MOSAIC  as  the  software  to  navigate  on  the  WWW.  In  addition,  several  branch  chiefs  were 
interviewed  as  well  as  several  area  company  representatives  (Kaman,  TRW  and  Booze,  Allen  &  Hamilton). 
Moreover,  in-depth  interviews  were  conducted  with  the  officers  of  the  Photonics  Development  Corporation 
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in  Rome,  NY,  as  well  as  members  of  the  RL  Library.  The  authors  interviewed  appropriate,  selected  private 
sector  firms  in  terms  of  their  willingness,  availability  and  readiness  to  enter  the  Internet  and  WWW  world. 
The  effects  of  the  interviews  were  surprising  to  some  extent  as  we  saw  a  snowball  effect  emerging.  That 
is  the  discussion  of  our  interests  in  the  electronic  transfer  process  generated  numerous  additional 
suggestions,  existing  and  potential  services,  as  well  as  data  bases.  All  in  all,  a  surprising  synergy  was 
created  between  the  research  team  and  RL  members  engaged  in  research  and  technology  transfer. 

The  Internet  and  the  World  Wide  Wpih 

The  Internet  is  a  very  fast  growing  organization  over  the  last  several  years.  No  one  knows  for  sure  the 
exact  number  of  users,  as  the  Internet  itself  is  an  organization  of  loosely  coupled  networks.  An  August  4, 
1 994  report  of  the  Internet  Society,  the  organizing  body  representing  the  Internet,  reports  dramatic  growth 
in  1994.  The  latest  Internet  Society  measurements  reveal  that  there  are  worldwide  3.2  million  reachable 
machines.  This  is  an  increase  of  81  percent  for  the  past  year  and  represents  an  even  steeper  than  normal 
increase  over  the  past  six  months.  Indeed,  one  million  new  hosts  were  added  during  the  first  six  months 
of  1994.  Of  the  world-wide  total,  the  U.  S.  connections  account  for  63  percent  (Internet  Society,  1994). 
The  fastest  currently  growing  segment  on  the  Internet  are  businesses,  again  demonstrating  the  potential 
to  reach  the  private  sector  via  the  Internet.  Although  it  is  difficult  to  specify  a  threshold  at  which  a  critical 
mass  of  business  Internet  users  has  been  reached,  it  is  possible  today  to  reach  most  major  firms  via  the 
Internet.  Small  and  medium  sized  firms  are  representative  of  the  vast  majority  of  firms  signing  on  the 
Internet  (Cf.,  e.  g.,  with  "Going  l-way?^  1994;  McBride,  1994). 

An  important  caveat  needs  to  be  expressed  here:  Just  getting  a  presence  on  the  Internet  Is  not  enough. 
Just  putting  one's  server  on  the  Internet  is  analogous  to  hanging  a  shingle  on  a  rural  road.  If  one  is  selling 
something  and  has  something  important  to  offer  that  is  intended  to  reach  a  large  audience.  It  is  important 
to  have  not  just  a  shingle,  but  a  major  advertising  board  on  the  main  drag,  i.  e.  a  major  highway. 

The  Internet's  rapid  proliferation  suggests  that  organizations  in  the  public  and  private  sectors  want  to  take 
advantage  of  the  Internet's  interactive  or  two-way  capabilities.  In  turn,  this  suggests  the  establishment  and 
exploitation  of  visible  and  invisible  social  and  physical  networks  linking  and  bridging  public  and  private 
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sector  organizations. 

The  appropriate  use  of  the  Internet  may  result  in  such  desirable  outcomes  as:  obtaining  support  from 
vendors,  participating  in  joint  development,  collaborative  research  projects,  providing  customer  support, 
marketing  and  product/service/know-how  distribution.  Due  to  space  limitations  it  Is  impossible  here  to 
highlight  various  features  of  a  value  chain  derived  from  appropriate  Internet  use.  This  value  chain  can  be 
segmented  into  internal  operations,  inputs  from  suppliers,  and  customer  relations.  The  Internet  In 
conjunction  with  other  appropriate  technology  and  software,  such  as  the  World  Wide  Web  and  MOSAIC, 
lends  itself  ideally  to  provide  an  effective,  quick  and  responsive  interactive  linkage  between  RL  and  its 
customers,  i.  e.  various  private  sector  firms  (Cf.,  e.  g.,  with  Blattberg  et  al.,  1994;  Dallaire,  1993).  Having 
a  home  page  on  the  World  Wide  Web  is  not  sufficient;  the  information  content  must  reflect  the  potential 
user's  information  need.  The  authors  have  incorporated  into  the  technology  transfer  home  page  the 
information  that  is  most  sought  after  by  potential  adopters  of  RL’s  technologies.  The  nature  of  this  linkage 
is  ideal  in  terms  of  cost-effectiveness  and  time-based  considerations.  Moreover,  it  provides  the  potential 
for  feedback,  the  essential  component  in  any  effective  communication  effort. 

Data  Bases  and  Electronic  Features  Available  Via  the  Internet  and  WWW 

During  the  summer  months  of  1 994  the  authors  designed,  built  and  applied  various  data  bases  and  services 
of  interest  in  the  technology  transfer  process.  As  mentioned  initially,  our  efforts  were  driven  by  an  overall 
customer/user  orientation.  The  designed  system  uses  MOSAIC  on  the  WWW,  is  fully  accessible  to  outside, 
i.  e.  non-RL,  users,  as  well  as  RL  users  and  features  the  following: 

What’s  Happening  in  Tech  Transfer  at  Rome  Lab? 

Technology  Transfer  at  Rome  Laboratory 
Hot  Products 

Technology  Transfer  Inquiries 
A  Model  CRDA 
CRDA  Data  Base 
Educational  Partnerships 
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Fact  Sheets  of  Directorates 


Facilities 

Skills  at  Rome  Laboratory  Database 
Rome  Laboratory  Staff  Directory 
The  Reliability  Engineer's  Tool  Kit 
Patents  Granted  at  Rome  Laboratory 

The  management,  use  and  ownership  of  these  databases  poses  an  intriguing  question.  It  is  very  important 
that  data  bases  are  kept  up-to-  date  and  various  current  features  such  as  Hot  Products  and  What's 
Happening  in  Tech  Transfer  at  Rome  Laboratory  feature  products  and  information  that  truly  is  current. 
Otherwise,  the  system  designed  will  be  of  little  use  to  the  outside,  and  RL  users  will  recognize  quickly  that 
it  is  not  a  timely  device  to  find  out  information  about  RL  and  its  technologies. 

The  authors  suggest  that  the  ownership  of  this  system  resides  with  the  Rome  Laboratory  Technology 
Transfer  Office.  Access  to  various  features  should  be  categorized  such  that  it  is  available  to  all  users 
(Including  the  private  sector)  or  only  the  Rome  Laboratory  Technology  Transfer  staff  or  only  the  RL 
Technology  Transfer  staff  and  all  RL  scientists  and  engineers.  We  recommend  that  the  gatekeepers  of 
information  as  well  as  the  implementors  of  information  into  the  system  are  the  various  Technology  Transfer 
Focal  Points  within  each  directorate. 


The  system  designed  clearly  demonstrates  that  the  technology  transfer  capabilities  can  be  achieved  via 
electronic  means.  This  elegant  design  derives  value  for  Rome  Laboratory  through  the  timely  and 
appropriate  alignment  of  information  technology  already  in  place  (personal  computers,  high-speed 
networks,  software  such  as  MOSAIC,  and  access  to  the  Internet  and  WWW),  Rome  Laboratory’s  goals  and 
strategies  with  regard  to  technology  transfer  and  its  larger  role  within  the  Armed  Forces  and  society,  as 
well  as  the  appropriate  alignment  with  existing  business  processes,  especially  with  regard  to  technology 
transfer.  Through  the  appropriate  alignment  and  organizational  fit,  it  is  possible  to  generate  added  value 
for  Rome  Laboratory.  It  is  of  considerable  importance,  though,  to  realize  that  this  designed  process 
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occurring  electronically  cannot  be  arrested  by  peripheral  traditional  processes  that  might  be  paper-based, 
and  still  assume  that  the  value  chain  continues  delivering  the  same  value.  Appropriate  electronic  alignment 
must  occur  in  such  peripheral  processes  as  well. 

One  must  realize  also  that  implementing  this  effort,  even  though  highly  cost-effective  when  compared  to 
more  traditional  means,  e.g.,  advertising,  does  not  come  cheaply.  The  system  as  conceived  must  be  taken 
on  by  an  individual  or  a  team  of  individuals  knowledgeable  of  the  technology  transfer  process,  RL,  private 
sector  operating  practices,  and  probably  some  technology  background  and  competence  would  be  helpful 
as  well.  The  task  at  hand  is  not  just  the  mounting  of  the  system  on  the  WWW,  but  the  effort  must  be 
nurtured,  customized  and  mothered.  Especially,  the  involvement  of  key  individuals  in  private  sector  firms 
with  an  interest  in  technology  transfer  in  such  electronic  organizational  forms  as  Bulletin  Boards,  Listservs, 
etc.  requires  sensitivity,  understanding  and  a  sense  of  managing  groups  from  a  group  dynamics 
perspective.  This  is  time-consuming  and  requires  patience  in  real  life,  but  especially  when  done 
electronically.  Numerous  examples,  however,  abound  demonstrating  success  in  such  efforts. 
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Abstract 

We  have  measured  and  compared  the  temperature  dependence  of  THz 
radiation  induced  by  ultrafast  photocarrier  transport  and  optical  rectification  via 
trans-bandgap  femtosecond  optical  excitation  from  a  single  <11 1>  oriented  GaAs 
crystal.  We  report  our  extended  measurement  of  the  temperature  dependence  of 
THz  emission  from  a  single  <111>  oriented  semi-insulating  GaAs  crystal  under 
femtosecond  trans-resonant  optical  excitation.  Specifically,  we  report  on  the 
temperature  effects  in  the  THz  radiation  resulting  separately  from  optical 
rectification  or  ultrafast  photocarrier  transport,  as  a  function  of  the  incident 
excitation  wavelength.  In  particular,  we  observed  enhanced  THz  emission  from 
both  radiation  mechanisms  when  the  incident  excitation  wavelength  was  tuned 
above  the  temperature  dependent  optical  absorption  bandedge. 
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Temperature  Dependence  of  Thz  Emission  from  <111> 
GaAs  via  Trans-Resonant  Excitation 


X.-C.  Zhang 


Introduction 

Previously,  an  initial  time-resolved  measurement  of  the  temperature 
dependence  of  optically  induced  femtosecond  electromagnetic  radiation  from  several 
bulk  semiconductors  was  performed,  in  which  a  significant  increase  in  the  peak 
radiated  field  was  observed  when  the  sample  temperature  was  reduced  [1].  Since,  at 
modest  incident  optical  fluence,  photocarrier  mobility  in  bulk  semiconductors 
depends  on  temperature,  the  increase  in  THz  radiation  was  attributed  to 
temperature  dependent  carrier  mobility  effects.  However,  due  to  the  off-normal 
incidence  of  the  laser  excitation  used  therein,  the  detected  signal  was  a 
superposition  of  radiation  produced  via  both  optical  rectification  and  ultrafast 
photocarrier  transport.  Furthermore,  due  to  the  limited  tunability  of  the  previous 
laser  source,  a  balanced  colliding  pulse  mode-locked  dye  laser,  a  measurement  of  the 
dispersion  in  THz  emission  as  a  function  of  temperature  was  not  provided. 
Currently,  it  is  a  critical  lack  of  information  about  the  temperature  dependence  in 
the  radiation  produced  by  optical  rectification  and  ultrafast  photocamer  transport 
which  limits  our  understanding  of  the  mechanisms  underlying  these  phenomena.  A 
very  recent  study  of  infrared  wavelength  and  temperature  dependence  of  optically 
induced  THz  radiation  from  InSb  demonstrated  a  substantial  change  in  the  THz 
emission  spectra  between  doped  and  undoped  InSb  at  low  temperatures  [2],  an 
effect  attributed  to  impurity  scattering  in  the  doped  sample. 

Experimental  Set  up 

Our  experimental  set  up  for  the  temperature  dependent  measurement  has 
been  fully  descried  previously  [1].  The  important  differences  between  the 
experimental  arrangement  used  previously  and  the  one  here,  are  that  here  we  used: 
(1)  a  single  <111>  oriented  GaAs  crystal  for  both  the  optical  rectification  and 
ultrafast  photocarrier  transport  measurements,  (2)  a  tunable  Ti:sapphire  laser,  and 
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(3)  normal  sample  illumination.  This  allowed  us  to  make  consistent  and  separate 
measurements  of  the  two  radiation  effects,  thus  eliminating  systematic  errors 
resulting  from  using  different  sample  crystals.  Fig.  1  schematically  illustrates  the 
relative  sample  orientations  used  during  the  two  experiments.  Normal  optical 
incidence  was  maintained  during  both  measurements,  and  the  incident  laser 
polarization  was  fixed  parallel  to  the  dipole  axis  of  the  detecting  antenna. 

Experimental  Results 

THz  radiation  can  be  optically  induced  when  certain  materials  are 
illuminated  by  ultrashort  laser  pulses.  The  two  major  radiation  mechanisms  which 
have  been  widely  studied  recently  are  (1)  ultrafast  photocarrier  transport  in  bulk 
semiconductors  and  (2)  THz  optical  rectification  in  non-centros5anmetric  crystals. 
THz  optical  rectification  is  a  special  case  of  nearly  degenerate  difference-frequency- 
generation  where  the  beating  frequency  ranges  from  near  DC  to  several  THz. 
Normal  illumination  of  a  <11 1>  oriented  zincblende  crystal  can  be  used  to 
distinguish  THz  emission  resulting  from  ultrafast  photocarrier  transport  from  that 
due  to  optical  rectification  [3] .  Under  this  arrangement,  in  the  absence  of  an  applied 
magnetic  field,  the  forward  emitted  THz  radiation  results  only  from  optical 
rectification.  However,  when  a  magnetic  field  is  applied,  the  path  of  the  transient 
photocarriers  is  bent  via  the  Lorentz  force,  so  that  a  component  of  the  THz  radiation 
produced  in  this  way  propagates  in  the  forward  direction.  Thus,  by  rotating  the 
azimuthal  angle  of  the  crystal  to  eliminate  THz  emission  via  optical  rectification,  we 
may  separately  observe  the  THz  emission  resulting  solely  from  ultrafast 
photocarrier  transport.  In  addition,  very  recently  the  two  contributions  to  the  THz 
radiation  induced  by  10  fs  laser  pulses  were  also  resolved  in  the  time-domain  [4]. 
This  was  possible  because  whereas  THz  optical  rectification  results  from  an 
instantaneous  virtual  excitation,  ultrafast  photocarrier  transport  extends  over 
several  hundreds  of  femtoseconds. 

For  the  optical  rectification  measurement,  as  shown  in  Fig.  1(a),  the  <111> 
oriented  crystal  was  aligned  with  its  [ll2]  axis  parallel  to  the  direction  of  the  laser 
beam  polarization.  This  geometry  maximizes  THz  radiation  via  optical  rectification 
in  the  forward  direction,  while  as  discussed  above,  normal  optical  incidence  in  the 
absence  of  an  applied  magnetic  field  prevents  THz  radiation  via  ultrafast 
photocarrier  transport  from  propagating  in  the  forward  direction. 
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Fig.  1(b)  shows  the  sample  rotation  used  for  the  transient  photocurrent 
portion  of  the  measurement.  The  [ll2]  axis  was  rotated  30  degrees  with  respect  to 
the  incident  optical  polarization,  whereby  following  crystal  symmetry,  the  THz 
optical  rectification  signal  to  nominally  zero.  To  produce  THz  radiation  via  ultrafast 
photocarrier  transport  in  the  forward  direction,  an  external  magnetic  field  of 
approximately  0.1  Tesla  was  applied  on  the  sample.  This  imposed  a  Lorentz  force  on 
the  photocarriers  which  produced  a  transverse  component  of  the  THz  signal  which 
propagated  in  the  forward  direction  [5].  This  ensured  the  THz  signal  detected  in 
this  manner  resulted  solely  from  transient  photocurrents.  Overall,  using  such  an 
arrangement,  as  shown  in  Fig.  1(a)  and  (b),  we  could  directly  determine  and 
separately  measure  the  THz  radiation  resulting  from  each  radiation  mechanism. 

To  test  the  symmetry  of  our  <111>  GaAs  crystal,  we  first  measured  the  peak 
value  of  THz  optical  rectification  signal  versus  azimuthal  angle  and  incident 
excitation  wavelength.  Fig.  2  displays  the  trans-resonant  dispersion  in  the  peak 
THz  optical  rectification  as  a  function  of  azimuthal  angle  at  room  temperature.  A 
three-fold  rotation  symmetry  in  the  signal  clearly  indicates  that  THz  optical 
rectification  via  a  bulk  process  is  the  dominant  radiation  mechanism.  The  trans¬ 
resonant  flip  in  polarity  of  the  signal,  as  the  incident  excitation  wavelength  is  tuned 
through  the  optical  absorption  bandedge,  at  a  constant  azimuthal  angle,  is  a  result 
of  the  optical  resonance  contribution  in  Re[x(2)].  This  measurement  is  consistent 
with  our  separate  previous  measurement  of  the  effects  of  either  crystal  symmetry  or 
incident  trans-resonant  optical  wavelength  on  THz  emission  by  optical  rectification 
[21,  but  now  includes  both.  We  also  measured  the  angular  dependence  of  THz 
radiation  emitted  by  ultrafast  photocarrier  transport.  To  within  our  instrument 
resolution,  we  confirmed  that  this  component  of  the  THz  radiation,  at  moderate 
optical  fluence,  is  independent  of  azimuthal  angle.  This  result  is  consistent  with  the 
spherical  sjnnmetry,  in  reciprocal  space,  of  the  resonant  electronic  states  near  the  T 

point  in  zincblende  crystals. 

Figs.  3  and  4  show  the  temporal  waveforms  radiated  via  THz  optical 
rectification  and  by  ultrafast  photocarrier  transport,  respectively,  versus  trans¬ 
resonant  optical  excitation  at  90  K.  In  Fig.  3,  we  further  observe  that  the  entire 
time-resolved  THz  optical  rectification  signal  undergoes  a  polarity  flip  as  the 
incident  excitation  wavelength  is  tuned  through  the  optical  absorption  bandedge.  In 
contrast,  since  THz  emission  via  ultrafast  photocarrier  transport  can  only  occur 
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under  resonant  optical  excitation,  in  Fig.  4,  no  trans-resonant  polarity  flip  can  be 
observed. 

Fig.  5  displays  the  peak  radiation  signal  via  THz  optical  rectification  versus 
incident  excitation  wavelength  and  sample  temperature.  The  measurement  was 
performed  at  normal  incidence,  in  the  absence  of  an  applied  magnetic  field  to 
eliminate  THz  emission  from  ultrafast  photocarrier  transport,  and  at  a  fixed 
azimuthal  angle  which  oriented  the  crystal  symmetry  to  maximize  THz  optical 
rectification.  The  resonance  feature  shifts  with  the  temperature  dependent  optical 
absorption  edge,  and  the  rising  edge  of  the  resonance  peak  is  sharper  at  lower 
temperatures.  Similarly  to  Fig.  5,  Fig.  6  plots  the  THz  radiation  from  ultrafast 
photocarrier  transport.  In  this  plot,  no  THz  radiation  was  measured  when  the 
incident  photon  energy  was  tuned  below  the  optical  absorption  bandedge. 

Currently,  there  is  a  considerable  interest  in  measuring  the  radiation 
efficiency  between  ultrafast  photocarrier  transport  and  THz  optical  rectification. 
Previously,  most  comparisons  of  radiation  efficiency  from  the  two  mechanisms  were 
made  using  different  samples.  However,  this  ratio  is  extremely  sensitive  to  the 
sample  preparation  as  it  depends  both  on  the  surface  condition  as  well  as  the 
strength  of  the  applied  magnetic  field.  Therefore,  our  single  crystal  approach 
provides  a  much  more  reliable  comparison  between  the  radiation  efficiency  of  the 
two  mechanisms.  In  this  experiment,  at  an  incident  excitation  wavelength  of  820 
nm  (1.51  eV)  and  a  magnetic  field  strength  of  0.1  Tesla,  we  found  that  from  our 
single  <111>  SI-GaAs  crystal,  the  magnitude  of  THz  emission  due  to  ultrafast 
photocarrier  transport  increased  from  3  to  30  times  larger  than  that  due  to  optical 
rectification  as  the  sample  temperature  was  decreasd  from  300  to  90  K.  This  room 
temperature  ratio  is  consistent  with  a  measurement  made  previously  [5]. 

Conclusion 

We  present  a  detailed  measurement  of  the  trans-resonant  dispersion  and 
temperature  dependence,  from  300  to  90  K,  of  THz  emission  via  both  optical 
rectification  and  ultrafast  photocarrier  transport  from  a  single  <111>  GaAs  crystal. 
Our  results  indicate  that  there  is  both  an  enhancement  of  the  resonant  second-order 
nonlinear  susceptibility  near  the  optical  absorption  bandedge  and  a  significant 
increase  in  photocarrier  mobility  at  low  temperatures. 
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Figure  Captions: 


Fig.  1. 

Fig.  2 

Fig.  3. 

Fig.  4. 

Fig.  5. 

Fig.  6 


Schematic  illustration  of  sample  geometry  for  forward  THz  emission  via: 
(a)  THz  optical  rectification  and  (b)  ultrafast  photocarrier  transport  under 
an  applied  magnetic  field.  The  direction  of  [  112]  axis  was  rotated  30 
degrees  between  (a)  and  (b). 

Peak  value  of  THz  optical  rectification  versus  incident  excitation 
wavelength  and  azimuthal  angle.  The  surface  gray-level,  ranging  from 
black  to  white,  corresponds  to  a  normalized  signal  magnitude  ranging 
fi'om  zero  to  the  extremum  value,  respectively. 

Temporal  waveform  of  THz  emission  via  optical  rectification  versus 
incident  excitation  wavelength  at  90  K. 

Temporal  waveform  of  THz  emission  via  ultrafast  photocarrier  transport 
versus  incident  excitation  wavelength  at  90  K. 

Peak  value  of  THz  emission  via  optical  rectification  versus  incident 
excitation  wavelength  and  sample  temperature.  Note:  the  sign  of  the  data 
has  been  reversed,  with  respect  to  Fig.  3,  to  improve  clarity. 

Peak  value  of  THz  emission  via  ultrafast  photocarrier  transport  versus 
incident  excitation  wavelength  and  sample  temperature.  Note:  the  sign  of 
the  data  has  been  reversed,  with  respect  to  Fig.  4,  to  improve  clarity. 
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